首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:RPK-table based efficient algorithm for join-aggregate query on MapReduce
  • 本地全文:下载
  • 作者:Zhan Li ; Qi Feng ; Wei Chen
  • 期刊名称:CAAI Transactions on Intelligence Technology
  • 电子版ISSN:2468-2322
  • 出版年度:2016
  • 卷号:1
  • 期号:1
  • 页码:79-89
  • DOI:10.1016/j.trit.2016.03.008
  • 出版社:IET Digital Library
  • 摘要:Join-aggregate is an important and widely used operation in database system. However, it is time-consuming to process join-aggregate query in big data environment, especially on MapReduce framework. The main bottlenecks contain two aspects: lots of I/O caused by temporary data and heavy communication overhead between different data nodes during query processing. To overcome such disadvantages, we design a data structure called Reference Primary Key table (RPK-table) which stores the relationship of primary key and foreign key between tables. Based on this structure, we propose an improved algorithm on MapReduce framework for join-aggregate query. Experiments on TPC-H dataset demonstrate that our algorithm outperforms existing methods in terms of communication cost and query response time.
  • 关键词:Join-aggregate query; MapReduce; Query optimization; RPK-table; Communication cost
国家哲学社会科学文献中心版权所有