首页    期刊浏览 2024年10月05日 星期六
登录注册

文章基本信息

  • 标题:Effective Parallel Processing Social Media Analytics Framework
  • 本地全文:下载
  • 作者:Ravindra Kumar Singh ; Harsh Kumar Verma
  • 期刊名称:Journal of King Saud University @?C Computer and Information Sciences
  • 印刷版ISSN:1319-1578
  • 出版年度:2022
  • 卷号:34
  • 期号:6
  • 页码:2860-2870
  • DOI:10.1016/j.jksuci.2020.04.019
  • 语种:English
  • 出版社:Elsevier
  • 摘要:The widespread adoption of opinion mining and sentiment analysis in higher cognitive processes encourages the need for real-time processing of social media data to capture insights about user's sentiment polarity, user’s opinion, and current trends of the domain. In recent years lots of research were conducted and various machine learning algorithms were developed around the processing of data to achieve higher accuracy while reducing the processing time is still challenging. Big data technologies came unraveled these challenges but they have their own set of complexities along with having hardware deadweight on the system. The contribution of this research paper is to touch upon the mentioned challenges by presenting a climbable, instantaneous and fault-tolerant framework to process real-time data to extract hidden insights within it while not bearing any additional overhead of big data technologies. This framework is versatile enough to support batch processing along with real-time data streams in parallel and distributed environments. Experimental results concluded that 4-threaded parallel architecture of the framework performs at 2X speed compared to single-threaded architecture and shared URLs, embedded Images and Author's meta info it boosting the tweets prediction. Moreover, this research additionally provides a comparison of Support Vector Machines (SVM), Light GBM (LGBM) and Long Short Term Memory (LSTM) supervised machine learning techniques for sentiment analysis and concluded LGBM is the most effective model.
  • 关键词:Social media analytics;Real time analytics;MongoDB;Redis;Python-dash;Visualization;Message broker;Cache management;Pre-processing block;Parallel processing;Data processing framework
国家哲学社会科学文献中心版权所有