首页    期刊浏览 2024年11月23日 星期六
登录注册

文章基本信息

  • 标题:Direct and Unbiased Multiple Imputation Methods for Missing Values of Categorical Variables
  • 本地全文:下载
  • 作者:Yuanhui Xiao ; Ruiguang Song ; Mi Chen
  • 期刊名称:Journal of Data Science
  • 印刷版ISSN:1680-743X
  • 电子版ISSN:1683-8602
  • 出版年度:2012
  • 卷号:10
  • 期号:3
  • 页码:465-481
  • 出版社:Tingmao Publish Company
  • 摘要:Missing data is a common problem in statistical analyses. Tomake use of information in data with incomplete observation, missing valuescan be imputed so that standard statistical methods can be used to analyzethe data. Variables with missing values are often categorical and the miss-ing pattern may not be monotone. Currently, commonly used imputationmethods for data with a non-monotone missing pattern do not allow di-rect inclusion of categorical variables. Categorical variables are converted tonumerical variables before imputation. For many applications, the imputednumerical values for those categorical variables must then be converted backto categorical values. However, this conversion introduces bias which canseriously a ect subsequent analyses. In this paper, we propose two directimputation methods for categorical variables with a non-monotone missingpattern: the direct imputation approach incorporated with the expectation-maximization algorithm and the direct imputation approach incorporatedwith a new algorithm: the imputation-maximization algorithm. Simulationstudies show that both methods perform better than the method using vari-able conversion. An application to real data is provided to compare thedirect imputation method and the method using variable conversion.
  • 关键词:Bias; categorical variable; HIV; missing values; multiple impu-;tation.
国家哲学社会科学文献中心版权所有