期刊名称:International Journal of Advanced Computer Research
印刷版ISSN:2249-7277
电子版ISSN:2277-7970
出版年度:2019
卷号:9
期号:44
页码:283-292
DOI:10.19101/IJACR.PID90
出版社:Association of Computer Communication Education for National Triumph (ACCENT)
摘要:There is a continued interest in understanding people’s interest through the contents they share online. However, the data generated is massive, characterized by textual jargons and tokens that contain no sentiment or opinion value. One way of reducing the data dimension and pruning of irrelevant features is feature selection. However, the existing approaches of feature selection are still inefficient. Two prominent feature selection methods in sentiment analysis are information gain and ontology-based methods. Information gain has the disadvantage of not considering redundancy between features while ontology-based approach requires a lot of human intervention. The aim of this paper is to review these two methods. The review of these two methods shows that using the two methods in a two-step approach can overcome their limitations and provide an optimal feature set for sentiment analysis.
关键词:Sentiment analysis; Feature selection; Information gain; Ontology.