文章基本信息

标题：An Efficient Data Fingerprint Query Algorithm Based on Two-Leveled Bloom Filter
本地全文：下载
作者：Zhou, Bin ; Zhu, Rongbo ; Zhang, Ying 等
期刊名称：Journal of Multimedia
印刷版ISSN：1796-2048
出版年度：2013
卷号：8
期号：2
页码：73-81
DOI：10.4304/jmm.8.2.73-81
语种：English
出版社：Academy Publisher
摘要：The function of the comparing fingerprints algorithm was to judge whether a new partitioned data chunk was in a storage system a decade ago. At present, in the most de-duplication backup system the fingerprints of the big data chunks are huge and cannot be stored in the memory completely. The performance of the system is unavoidably retarded by data chunks accessing the storage system at the querying stage. Accordingly, a new query mechanism namely Two-stage Bloom Filter (TBF) mechanism is proposed. Firstly, as a representation of the entirety for the first grade bloom filter, each bit of the second grade bloom filter in the TBF represents the chunks having the identical fingerprints reducing the rate of false positives. Secondly, a two-dimensional list is built corresponding to the two grade bloom filter for the absolute addresses of the data chunks with the identical fingerprints. Finally, a new hash function class with the strong global random characteristic is set up according to the data fingerprints’ random characteristics. To reduce the comparing data greatly, TBF decreases the number of accessing disks, improves the speed of detecting the redundant data chunks, and reduces the rate of false positives which helps the improvement of the overall performance of system.
关键词：data fingerprint;bloom filter;two-level bloom filter;de-duplication;hash