标题:Multiple imputation for demographic hazard models with left-censored predictor variables: Application to employment duration and fertility in the EU-SILC
摘要:Objective: A common problem when using panel data is that individuals’ histories are incompletely known at the first wave. We demonstrate the use of multiple imputation as a method to handle this partial information, and thereby increase statistical power without compromising the model specification. Methods: Using EU-SILC panel data to investigate full-time employment as a predictor of partnered women’s risk of first birth in Poland, we first multiply imputed employment status two years earlier to cases for which employment status is observed only in the most recent year. We then derived regression estimates from the full, multiply imputed sample, and compared the coefficient and standard error estimates to those from complete-case estimation with employment status observed both one and two years earlier. Results: Relative to not being full-time employed, having been full-time employed for two or more years was a positive and statistically significant predictor of childbearing in the multiply imputed sample, but was not significant when using complete-case estimation. The variance about the ‘two or more years’ coefficient was one third lower in the multiply imputed sample than in the complete-case sample. Contribution: By using MI for left-censored observations, researchers using panel data may specify a model that includes characteristics of state or event histories without discarding observations for which that information is only partially available. Using conventional methods, either the analysis model must be simplified to ignore potentially important information about the state or event history (risking biased estimation), or cases with partial information must be dropped from the analytical sample (resulting in inefficient estimation).
关键词:employment spells;fertility;left-censored data;multiple imputation;panel data