期刊名称:International Journal of Security and Its Applications
印刷版ISSN:1738-9976
出版年度:2012
卷号:6
期号:2
出版社:SERSC
摘要:In a data deduplication system, the performance of data deduplication algorithms are varying on the condition of file contents. For example, if a file is modified at the end of file region then Fixed-length Chunking algorithm superior to Variable-length Chunking in terms of computation time with similar space reduction result. Therefore, it is important to predict in which location of a file is modified in a deduplication system. In this paper, we discuss a new approach to one of the key methods that is invariably applied to data deduplication. The essential idea is to exploit an efficient file pattern checking scheme that can be used for data deduplication. The contribution of this paper is to find in which region of a file is modified using file similarity information. The file modification pattern can be used for elaborating data deduplication system for selecting deduplication algorithm. Experiment result shows that the proposed system can predict file modification region with high probability