期刊名称:International Journal on Computer Science and Engineering
印刷版ISSN:2229-5631
电子版ISSN:0975-3397
出版年度:2013
卷号:5
期号:2
页码:56-61
出版社:Engg Journals Publications
摘要:A common problem encountered by many data mining techniques is the missing data. A missing data is defined as an attribute or feature in a dataset which has no associated data value. Correct treatment of these data is crucial, as they have a negative impact on the interpretation and result of data mining processes. Missing value handling techniques can be grouped into four categories, namely, complete case analysis, Imputation methods, maximum likelihood methods and machine learning methods. Out of these imputation methods are the widely used solution for handling missing values. However, there are situations when imputation methods might not work correctly. This study studies and analyzes the performance of two algorithms, one imputation based and another without imputation based classification on missing data.
关键词:Missing Values; Imputation; Non-imputation; Classification with missing data.