首页    期刊浏览 2024年12月01日 星期日
登录注册

文章基本信息

  • 标题:A Survey on Big Data Analytics Using Hadoop Ecosystem Tools
  • 本地全文:下载
  • 作者:Monika Yadav ; Sonal Chaudhary
  • 期刊名称:International Journal of Computer Science and Information Technologies
  • 电子版ISSN:0975-9646
  • 出版年度:2016
  • 卷号:7
  • 期号:4
  • 页码:2100-2102
  • 出版社:TechScience Publications
  • 摘要:Big data is the term for any assortment of data setsthus massive and complex that it becomes tough to processusing traditional processing applications. The challengesinclude analysis, capture, curation, search, sharing, storage,transfer, image, and privacy violations. The trend to largerdata sets is because of the extra info derived from analysis ofone massive set of connected data, as compared to separatesmaller sets with identical total quantity of data, allowingcorrelations to be found to "spot business trends, preventdiseases, combat crime so on." huge data is difficult to figurewith exploitation most electronic database managementsystems and desktop statistics and visualization packages,requiring instead "massively parallel software running ontens, hundreds, or perhaps thousands of servers". Big datasometimes includes data sets with sizes on the far side thepower of commonly used software tools to capture, curate,manage, and process data inside a tolerable time period. hugedata "size" is a perpetually moving target, as of its startingfrom many dozen terabytes to several petabytes of data. hugedata may be a set of techniques and technologies that neednew varieties of integration to uncover massive hidden valuesfrom massive datasets that are various, complex, and of anenormous scale. Big data environment is employed to amass,organize and analyze the various varieties of data. There is anobservation concerning Map Reduce framework thatframework generates great deal of intermediate data.Therefore, in addition because the tasks finishes there iswould like of throwing that rich data, because MapReduce isunable to utilize them.
国家哲学社会科学文献中心版权所有