期刊名称:International Journal of Computer Science and Information Technologies
电子版ISSN:0975-9646
出版年度:2016
卷号:7
期号:6
页码:2402-2404
出版社:TechScience Publications
摘要:Twitter, one of the largest and famous social mediasite receives millions of tweets every day on variety ofimportant topic. This large amount of raw data can be usedfor industrial , Social, Economic, Government policies orbusiness purpose by organizing according to our need andprocessing. Hadoop is one of the best tool options for twitterdata analysis and hadoop works for distributed Big data ,Streaming data , Time Stamped data , text data etc. Thispaper discuss how to use FLUME for extracting twitter dataand store it into HDFS for analysis, and after that we are usehadoop ecosystem for analysing these data.
关键词:Hadoop; twitter; Flume; social analysis; hadoop;ecosystem.