首页    期刊浏览 2024年11月28日 星期四
登录注册

文章基本信息

  • 标题:TOKEN-BASED METHOD OF BLOCKING RECORDS FOR LARGE DATA WAREHOUSE
  • 本地全文:下载
  • 作者:Jebamalar Tamilselvi J., Saravanan V
  • 期刊名称:Advances in Information Mining
  • 印刷版ISSN:0975-3265
  • 电子版ISSN:0975-9093
  • 出版年度:2010
  • 期号:498
  • 页码:5-10
  • 出版社:Bioinfo Publications
  • 摘要:Record linkage is a critical problem in duplicate data elimination. It is used to detect and eliminate duplicate data. The elimination of duplicate data will increase the quality of data. Record Linkage problem will take high computational cost because of the large number of record comparisons. The comparison of records is inefficient in large data warehouses. Blocking methods are used to group the records to minimize the number of record comparisons. This paper explains the existing blocking methods and its comparison and discusses the selection of token-based blocking key for record comparisons.
国家哲学社会科学文献中心版权所有