期刊名称:International Journal of Computer Trends and Technology
电子版ISSN:2231-2803
出版年度:2017
卷号:48
期号:1
页码:19-23
DOI:10.14445/22312803/IJCTT-V48P105
出版社:Seventh Sense Research Group
摘要:The term big data refers to data sets whose volume, variability and speed of velocity make them difficult to capture, manage, procedure or analyzed. To examine this huge amount of data Hadoop is able to be used. Hadoop is an open source software project that enables the spread giving out of large data sets across a cluster of creation servers.ETL tools extract important information from various data sources, various transformation’s of data are established out transformation phase and then load into the big data. HDFS ( Hadoop Distributed File System), is a spread file system design to hold the very huge of data (petabytes or even zettabytes), and there high throughput admission to this information. Map Reduce method has been calculated in this paper which is required for implement Big Data Analysis using HDFS. In this paper the related topics of Big Data Analytics, and Hadoop, ETL, Map Reduce are reviewed.