文章基本信息

标题：A Narrative Study on Big Data
本地全文：下载
作者：Meenakshi Jaiswal ; Rubal Jeet
期刊名称：International Journal of Innovative Research in Computer and Communication Engineering
印刷版ISSN：2320-9798
电子版ISSN：2320-9801
出版年度：2017
卷号：5
期号：4
页码：6812
DOI：10.15680/IJIRCCE.2017.0504033
出版社：S&S Publications
摘要：Big Data, the analysis of large quantities of data to gain new insight has become a ubiquitous phrase. Inrecent years, day by day the data is growing at a staggering rate. One of the efficient technologies that deal with the BigData is Hadoop and Apache Spark, which will be discussed in this paper. This paper includes many libraries, bindingsfor different popular languages etc. Objectives were to compare Hadoop and Spark .As a result Spark continues to beheavily developed and maintained, next generation software for big data processing are being released. Apache Sparkwas able to analyze streamed tweets with very minor latency of few seconds. This proves that, despite being big generalpurpose, Interactive and flexible big data processing engine [3]. The process of analyzing big data using spark, thecouple of improvement areas were identified as of importance should be persuaded as future work.
关键词：Big Data; Apache Spark; Hadoop YARN; HDFS; Hadoop MapReduce