首页    期刊浏览 2024年11月26日 星期二
登录注册

文章基本信息

  • 标题:Adaptive Replication Management using Predictor for HDFS
  • 本地全文:下载
  • 作者:I.Sugapriya ; K.Bhuvaneswari
  • 期刊名称:International Journal of Advanced Research in Computer Engineering & Technology (IJARCET)
  • 印刷版ISSN:2278-1323
  • 出版年度:2017
  • 卷号:6
  • 期号:5
  • 页码:568-573
  • 出版社:Shri Pannalal Research Institute of Technolgy
  • 摘要:The number of applications based on Apache Hadoop is dramatically increasing due to the robustness and dynamic features of this system. At the heart of Apache Hadoop, the Hadoop Distributed File System (HDFS) provides the reliability and high availability for computation by applying a static replication by default. However, because of the characteristics of parallel operations on the application layer, the access rate for each data file in HDFS is completely different. Consequently, maintaining the same replication mechanism for every data file leads to detrimental effects on the performance. By rigorously considering the drawbacks of the HDFS replication, this paper proposes an approach to dynamically replicate the data file based on the predictive analysis. With the help of probability theory, the utilization of each data file can be predicted to create a corresponding replication strategy. Eventually, the popular files can be subsequently replicated according to their own access potentials. For the remaining low potential files, an erasure code is applied to maintain the reliability. Hence, our approach simultaneously improves the availability while keeping the reliability in comparison to the default scheme. Furthermore, the complexity reduction is applied to enhance the effectiveness of the prediction when dealing with Big Data.
  • 关键词:Replication; HDFS; proactive prediction; optimization; Bayesian learning; Gaussian process
国家哲学社会科学文献中心版权所有