首页    期刊浏览 2024年11月27日 星期三
登录注册

文章基本信息

  • 标题:A Brief Review Along With a New Proposed Approach of Data De Duplication
  • 本地全文:下载
  • 作者:Suprativ Saha ; Avik Samanta
  • 期刊名称:Computer Science & Information Technology
  • 电子版ISSN:2231-5403
  • 出版年度:2013
  • 卷号:3
  • 期号:2
  • 页码:223-231
  • DOI:10.5121/csit.2013.3220
  • 出版社:Academy & Industry Research Collaboration Center (AIRCC)
  • 摘要:Recently the incremental growth of the storage space and data is parallel. At any instant data may go beyond than storage capacity. A good RDBMS should try to reduce the redundancies as far as possible to maintain the consistencies and storage cost. Apart from that a huge database with replicated copies wastes essential spaces which can be utilized for other purposes. The first aim should be to apply some techniques of data deduplication in the field of RDBMS. It is obvious to check the accessing time complexity along with space complexity. Here different techniques of data de duplication approaches are discussed. Finally based on the drawback of those approaches a new approach involving row id, column id and domain-key constraint of RDBMS is theoretically illustrated. Though apparently this model seems to be very tedious and non-optimistic, but in reality for a large database with lot of tables containing lot of lengthy fields it can be proved that it reduces the space complexity vigorously with same accessing speed.
  • 关键词:DeDuplication; SQL; Chunk; Genetic Algorithm; Replicated copy; Domain Key Constraint
国家哲学社会科学文献中心版权所有