期刊名称:Journal of Theoretical and Applied Information Technology
印刷版ISSN:1992-8645
电子版ISSN:1817-3195
出版年度:2016
卷号:92
期号:2
出版社:Journal of Theoretical and Applied
摘要:Without any doubt, XML data model considered the most dominant document type over the web with more than 60% of the total; nevertheless, their quality is not as expected. Data cleaning is equipped to overcome database�s quality issues. Integrity Constraint is a very important criterion for keeping data in a consistent manner, almost all previous XML dependencies are introduced to improve the schema and normalization, with a small effort toward improving data instance. This paper summarizes the most important XML integrity constraint notations and data cleaning approaches. In addition, to highlight the shortcoming of these constraints and proved it is limitation for increasing data quality. Finally, introduce the next generation of conditional integrity constraints, which will be held mainly for data cleaning issues.
关键词:XML; Data Quality; Data Cleaning; Integrity Constraints.