期刊名称:International Journal of Computer Science and Information Technologies
电子版ISSN:0975-9646
出版年度:2016
卷号:7
期号:4
页码:2100-2102
出版社:TechScience Publications
摘要:Big data is the term for any assortment of data setsthus massive and complex that it becomes tough to processusing traditional processing applications. The challengesinclude analysis, capture, curation, search, sharing, storage,transfer, image, and privacy violations. The trend to largerdata sets is because of the extra info derived from analysis ofone massive set of connected data, as compared to separatesmaller sets with identical total quantity of data, allowingcorrelations to be found to "spot business trends, preventdiseases, combat crime so on." huge data is difficult to figurewith exploitation most electronic database managementsystems and desktop statistics and visualization packages,requiring instead "massively parallel software running ontens, hundreds, or perhaps thousands of servers". Big datasometimes includes data sets with sizes on the far side thepower of commonly used software tools to capture, curate,manage, and process data inside a tolerable time period. hugedata "size" is a perpetually moving target, as of its startingfrom many dozen terabytes to several petabytes of data. hugedata may be a set of techniques and technologies that neednew varieties of integration to uncover massive hidden valuesfrom massive datasets that are various, complex, and of anenormous scale. Big data environment is employed to amass,organize and analyze the various varieties of data. There is anobservation concerning Map Reduce framework thatframework generates great deal of intermediate data.Therefore, in addition because the tasks finishes there iswould like of throwing that rich data, because MapReduce isunable to utilize them.