首页    期刊浏览 2025年05月24日 星期六
登录注册

文章基本信息

  • 标题:MHDFS: A Memory-Based Hadoop Framework for Large Data Storage
  • 本地全文:下载
  • 作者:Aibo Song ; Maoxian Zhao ; Yingying Xue
  • 期刊名称:Scientific Programming
  • 印刷版ISSN:1058-9244
  • 出版年度:2016
  • 卷号:2016
  • DOI:10.1155/2016/1808396
  • 出版社:Hindawi Publishing Corporation
  • 摘要:Hadoop distributed file system (HDFS) is undoubtedly the most popular framework for storing and processing large amount of data on clusters of machines. Although a plethora of practices have been proposed for improving the processing efficiency and resource utilization, traditional HDFS still suffers from the overhead of disk-based low throughput and I/O rate. In this paper, we attempt to address this problem by developing a memory-based Hadoop framework called MHDFS. Firstly, a strategy for allocating and configuring reasonable memory resources for MHDFS is designed and RAMFS is utilized to develop the framework. Then, we propose a new method to handle the data replacement to disk when memory resource is excessively occupied. An algorithm for estimating and updating the replacement is designed based on the metrics of file heat. Finally, substantial experiments are conducted which demonstrate the effectiveness of MHDFS and its advantage against conventional HDFS.
国家哲学社会科学文献中心版权所有