期刊名称:International Journal of Innovative Research in Computer and Communication Engineering
印刷版ISSN:2320-9798
电子版ISSN:2320-9801
出版年度:2017
卷号:5
期号:3
页码:3918
DOI:10.15680/IJIRCCE.2017.0503039
出版社:S&S Publications
摘要:Cloud computing involves deploying groups of remote servers and software networks that allowcentralized data storage and online access to computer services or resources. With an increasing demand of cloudstorage, effective methods need to be placed to reduce hardware costs, meet the bandwidth requirements and to increasestorage efficiency. Deduplication is the process of storing duplicate data in a single instance and just replicating areference pointer. This process is intended and used to store more data logically and provide more free spacephysically. Many systems are stick to a limited number of file formats as they address the logical structure of the file ina certain format. This problem is solved by ingenious approach of hashing data in its binary form itself. Using thisapproach has an advantage of including all kinds of file formats because the proposed system has no relevance with theactual structure of the file in terms of a certain format. The data here is considered as a general entity and is dividedinto fixed size blocks over which hash is computed. This hash is matched whenever a new file arrives. This approachyields better results over existing approaches for deduplication.