期刊名称:International Journal of Advanced Research in Computer Engineering & Technology (IJARCET)
印刷版ISSN:2278-1323
出版年度:2014
卷号:3
期号:5
页码:1571-1575
出版社:Shri Pannalal Research Institute of Technolgy
摘要:Big data has become a buzzword in the recent years. Big data is used to describe a massive volume of both structured and unstructured data that is so large that its difficult to process using traditional database and software techniques. Big data is a collection of massive and complex data sets that include the huge quantities of data, social media analytics, data management capabilities and real time data. Big data refers to various forms of large information sets that require special computational platforms in order to be analyzed. Big data is characterized by three Vs namely volume, variety and velocity. Hadoop framework supports the processing of large data sets in a distributed computing environment. Hadoop is the core of the Hadoop File System and MapReduce, well designed to handle huge volumes of data across a large number of nodes. This paper presents the processing functionality of Hadoop to distribute computational power in processing the massive data.