首页    期刊浏览 2024年09月18日 星期三
登录注册

文章基本信息

  • 标题:Data Quality in Big Data: A Review
  • 本地全文:下载
  • 作者:Noraini Abdullah ; Saiful Adli Ismail ; Siti Sophiayati
  • 期刊名称:International Journal of Advances in Soft Computing and Its Applications
  • 印刷版ISSN:2074-8523
  • 出版年度:2015
  • 卷号:7
  • 期号:3-Special
  • 出版社:International Center for Scientific Research and Studies
  • 摘要:The Data Warehousing Institute (TDWI) estimates that data quality problems cost U.S. businesses more than $600 billion a year. The problem with data is that its quality quickly degenerates over time. Experts say 2 percent of records in a customer file become obsolete in one month because customers die, divorce, marry, and move. In addition, data entry errors, system migrations, and changes in source systems, among other things, generate bucket loads of errors. More complex, as organizations fragment into different divisions and units, interpretations of data elements change to meet the local business needs. However, there are several ways that the Company should concern, such as to treat data as a strategic corporate resource; develop a program for managing data quality with a commitment from the top; and hire, train, or outsource experienced data quality professionals to oversee and carry out the program. The Organizations can sustain a commitment to managing data quality over time and adjust monitoring and cleansing processes to changes in the business and underlying systems by using the Commercial data quality tools. Data is a vital resource. Companies that invest proportionally to manage this resource will stand a stronger chance of succeeding in today's competitive global economy than those that squander this critical resource by neglecting to ensure adequate levels of quality. This paper reviews the characteristics of big data quality and the managing processes that are involved in it
  • 关键词:Big data; five v's; Data Quality; Big Value; Data Quality ; Attributes; Data Quality Methodology
国家哲学社会科学文献中心版权所有