首页    期刊浏览 2024年07月09日 星期二
登录注册

文章基本信息

  • 标题:The Optimization and Improvement of MapReduce in Web Data Mining
  • 本地全文:下载
  • 作者:Jun Qu ; Chang-Qing Yin ; Shangwei Song
  • 期刊名称:Journal of Software Engineering and Applications
  • 印刷版ISSN:1945-3116
  • 电子版ISSN:1945-3124
  • 出版年度:2015
  • 卷号:08
  • 期号:08
  • 页码:395-406
  • DOI:10.4236/jsea.2015.88039
  • 语种:English
  • 出版社:Scientific Research Publishing
  • 摘要:Extracting and mining social networks information from massive Web data is of both theoretical and practical significance. However, one of definite features of this task was a large scale data processing, which remained to be a great challenge that would be addressed. MapReduce is a kind of distributed programming model. Just through the implementation of map and reduce those two functions, the distributed tasks can work well. Nevertheless, this model does not directly support heterogeneous datasets processing, while heterogeneous datasets are common in Web. This article proposes a new framework which improves original MapReduce framework into a new one called Map-Reduce-Merge. It adds merge phase that can efficiently solve the problems of heterogeneous data processing. At the same time, some works of optimization and improvement are done based on the features of Web data.
  • 关键词:Cloud Computing;Web Data;MapReduce;Map-Reduce-Merge
国家哲学社会科学文献中心版权所有