期刊名称:International Journal of Grid and Distributed Computing
印刷版ISSN:2005-4262
出版年度:2015
卷号:8
期号:2
页码:273-284
DOI:10.14257/ijgdc.2015.8.2.24
出版社:SERSC
摘要:Based on the study about the basic idea of PageRank algorithm, combining with the MapReduce distributed programming concepts, the paper first proposed a parallel PageRank algorithm based on adjacency list which is suitable for massive data processing. Then, after examining the essential characteristics of iteration hidden behind the PageRank, it provided an iteration acceleration model based on vector computing. Following, using such acceleration model, the paper again brought forward a parallel PageRank algorithm with power iteration acceleration with MapReduce. Finally, after abundant experimental analyses, it has been proved that the both the two proposed algorithm can be suitable for massive data processing and the 2 nd one can significantly reduce the numbers of iteration and improve the efficiency of PageRank algorithm.
关键词:PageRank; Power Iteration Acceleration; Parallel; MapReduce