首页    期刊浏览 2024年11月07日 星期四
登录注册

文章基本信息

  • 标题:Automatic Hate Speech Detection using Machine Learning: A Comparative Study
  • 本地全文:下载
  • 作者:Sindhu Abro ; Sarang Shaikh ; Zahid Hussain Khand
  • 期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
  • 印刷版ISSN:2158-107X
  • 电子版ISSN:2156-5570
  • 出版年度:2020
  • 卷号:11
  • 期号:8
  • DOI:10.14569/IJACSA.2020.0110861
  • 出版社:Science and Information Society (SAI)
  • 摘要:The increasing use of social media and information sharing has given major benefits to humanity. However, this has also given rise to a variety of challenges including the spreading and sharing of hate speech messages. Thus, to solve this emerging issue in social media sites, recent studies employed a variety of feature engineering techniques and machine learning algorithms to automatically detect the hate speech messages on different datasets. However, to the best of our knowledge, there is no study to compare the variety of feature engineering techniques and machine learning algorithms to evaluate which feature engineering technique and machine learning algorithm outperform on a standard publicly available dataset. Hence, the aim of this paper is to compare the performance of three feature engineering techniques and eight machine learning algorithms to evaluate their performance on a publicly available dataset having three distinct classes. The experimental results showed that the bigram features when used with the support vector machine algorithm best performed with 79% off overall accuracy. Our study holds practical implication and can be used as a baseline study in the area of detecting automatic hate speech messages. Moreover, the output of different comparisons will be used as state-of-art techniques to compare future researches for existing automated text classification techniques.
  • 关键词:Hate speech; online social networks; natural language processing; text classification; machine learning
国家哲学社会科学文献中心版权所有