期刊名称:International Journal of Data Mining & Knowledge Management Process
印刷版ISSN:2231-007X
电子版ISSN:2230-9608
出版年度:2011
卷号:1
期号:2
出版社:Academy & Industry Research Collaboration Center (AIRCC)
摘要:Data cleaning, also called data cleansing or scrubbing, deals with detecting and removing errors and inconsistencies from data in order to improve the quality of data. Data quality problems are present in single data collections, such as files and databases,. When multiple data sources need to be integrated, e.g., in data warehouses, federated database systems or global web-based information systems, the need for data cleaning increases significantly. Data cleaning is the necessary condition of knowledge discovery and data warehouse building. In this paper two algorithms are designed using data mining technique to correct the attribute without external reference. One is Context-dependent attribute correction and another is Context-independent attribute correction.