期刊名称:Journal of King Saud University @?C Computer and Information Sciences
印刷版ISSN:1319-1578
出版年度:2022
卷号:34
期号:6
页码:2860-2870
DOI:10.1016/j.jksuci.2020.04.019
语种:English
出版社:Elsevier
摘要:The widespread adoption of opinion mining and sentiment analysis in higher cognitive processes encourages the need for real-time processing of social media data to capture insights about user's sentiment polarity, user’s opinion, and current trends of the domain. In recent years lots of research were conducted and various machine learning algorithms were developed around the processing of data to achieve higher accuracy while reducing the processing time is still challenging. Big data technologies came unraveled these challenges but they have their own set of complexities along with having hardware deadweight on the system. The contribution of this research paper is to touch upon the mentioned challenges by presenting a climbable, instantaneous and fault-tolerant framework to process real-time data to extract hidden insights within it while not bearing any additional overhead of big data technologies. This framework is versatile enough to support batch processing along with real-time data streams in parallel and distributed environments. Experimental results concluded that 4-threaded parallel architecture of the framework performs at 2X speed compared to single-threaded architecture and shared URLs, embedded Images and Author's meta info it boosting the tweets prediction. Moreover, this research additionally provides a comparison of Support Vector Machines (SVM), Light GBM (LGBM) and Long Short Term Memory (LSTM) supervised machine learning techniques for sentiment analysis and concluded LGBM is the most effective model.
关键词:Social media analytics;Real time analytics;MongoDB;Redis;Python-dash;Visualization;Message broker;Cache management;Pre-processing block;Parallel processing;Data processing framework