期刊名称:IMPACT : International Journal of Research in Applied, Natural and Social Sciences
印刷版ISSN:2347-4580
电子版ISSN:2321-8851
出版年度:2016
卷号:4
期号:8
页码:101-106
语种:English
出版社:IMPACT Journals
摘要:This paper presents a review on modern technology called HADOOP, which is used for managing very large amount of data. Multi Peta-byte Data sets becomes challenge for companies to process effectively and efficiently. Conversation about Big Data for very long without running into the elephant in the room is not possible. It is complex to have the data at distributed locations to process. For this a solution is needed i.e. open Source Apache License: HADOOP. It stores enormous data sets across distributed clusters of servers and then running “distributed” analysis applications in each cluster. Data applications will continue to run even when individual servers or cluster fails. Hadoop is almost completely modular, that allow swap out almost any of its components for a different software tool due to the flexibility of architecture.