首页    期刊浏览 2024年11月29日 星期五
登录注册

文章基本信息

  • 标题:Trend Analysis through Hashtags Popularity Level Using Hadoop with Hive
  • 本地全文:下载
  • 作者:Deepak Ranjan ; Dr. Tripti Arjariya ; Dr. Mohit Gangwar
  • 期刊名称:International Journal of Innovative Research in Science, Engineering and Technology
  • 印刷版ISSN:2347-6710
  • 电子版ISSN:2319-8753
  • 出版年度:2017
  • 卷号:6
  • 期号:5
  • 页码:9766
  • DOI:10.15680/IJIRSET.2017.0605318
  • 出版社:S&S Publications
  • 摘要:Social media generates massive amounts of data every minute, which is caused by its mainstreamadoption over the past years. Innovations in the industry have enabled new ways of communications between peopleand created many business opportunities. Big Data in social media require effective and advanced processingtechnologies. Purpose of data mining analyses is to find valuable patterns and insights from Twitter data. Thus, analysisfor twitter data is meaningful for both individuals and organizations to make decisions. Due to the huge amount of datagenerated by twitter every day, a system which can store and process big data is becoming a problem. In this paper, wepresent a method to collect twitter data sets, and store and analyze the data sets on Hadoop platform. Here wecan fetch real time twitter data and store it into the HDFS and then we develop an trend analysis mechanism which ishive web interface to analyze the twitter hashtag keywords along with its frequency through which we can get the mosttrendy keywords right now on the twitter through hashtag popularity level [11][13].
  • 关键词:Hadoop; data mining; data analysis; social datas; flume; hive.
国家哲学社会科学文献中心版权所有