Aufsatz(elektronisch)24. Dezember 2018

Simultaneous Edit and Imputation For Household Data with Structural Zeros

In: Journal of survey statistics and methodology: JSSAM, Band 7, Heft 4, S. 498-519

Verfügbarkeit an Ihrem Standort wird überprüft

Abstract

Abstract
Multivariate categorical data nested within households often include reported values that fail edit constraints—for example, a participating household reports a child's age as older than his biological parent's age—and have missing values. Generally, agencies prefer datasets to be free from erroneous or missing values before analyzing them or disseminating them to secondary data users. We present a model-based engine for editing and imputation of household data based on a Bayesian hierarchical model that includes (i) a nested data Dirichlet process mixture of products of multinomial distributions as the model for the true latent values of the data, truncated to allow only households that satisfy all edit constraints, (ii) a model for the location of errors, and (iii) a reporting model for the observed responses in error. The approach propagates uncertainty due to unknown locations of errors and missing values, generates plausible datasets that satisfy all edit constraints, and can preserve multivariate relationships within and across individuals in the same household. We illustrate the approach using data from the 2012 American Community Survey.

Sprachen

Englisch

Verlag

Oxford University Press (OUP)

ISSN: 2325-0992

DOI

10.1093/jssam/smy022

Problem melden

Wenn Sie Probleme mit dem Zugriff auf einen gefundenen Titel haben, können Sie sich über dieses Formular gern an uns wenden. Schreiben Sie uns hierüber auch gern, wenn Ihnen Fehler in der Titelanzeige aufgefallen sind.