期刊名称:International Journal of Computer Science & Information Technology (IJCSIT)
印刷版ISSN:0975-4660
电子版ISSN:0975-3826
出版年度:2015
卷号:7
期号:4
页码:119
出版社:Academy & Industry Research Collaboration Center (AIRCC)
摘要:Map Reduce has gained remarkable significance as a prominent parallel data processing tool in theresearch community, academia and industry with the spurt in volume of data that is to be analyzed. MapReduce is used in different applications such as data mining, data analytics where massive data analysis isrequired, but still it is constantly being explored on different parameters such as performance andefficiency. This survey intends to explore large scale data processing using MapReduce and its variousimplementations to facilitate the database, researchers and other communities in developing the technicalunderstanding of the MapReduce framework. In this survey, different MapReduce implementations areexplored and their inherent features are compared on different parameters. It also addresses the openissues and challenges raised on fully functional DBMS/Data Warehouse on MapReduce. The comparisonof various Map Reduce implementations is done with the most popular implementation Hadoop and othersimilar implementations using other platforms.
关键词:MapReduce; Parallel Data Processing tools; MapReduceFrameworks; Hadoop; DBMS/DataWarehouse.