Multivariate data imputation using trees.

  1. Tusell Palmer, Fernando Jorge
  2. Bárcena Ruiz, María Jesús
Zeitschrift:
Documentos de Trabajo BILTOKI

ISSN: 1134-8984

Datum der Publikation: 2002

Nummer: 5

Art: Arbeitsdokument

Andere Publikationen in: Documentos de Trabajo BILTOKI

Zusammenfassung

We address the problem of completing two files with records containing a fully observed common subset of variables. The tecnique investigated involves the use of regression and/or classification trees. An extension of current methodology (the intersection-seeking or ``forest-climbing'' algorithm) is proposed to deal with multivariate response variables. The method is demonstrated and shown to be feasible and have some desirable properties.