首页    期刊浏览 2025年09月21日 星期日
登录注册

文章基本信息

  • 标题:Arabic Text Classification using Feature-Reduction Techniques for Detecting Violence on Social Media
  • 本地全文:下载
  • 作者:Hissah ALSaif ; Taghreed Alotaibi
  • 期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
  • 印刷版ISSN:2158-107X
  • 电子版ISSN:2156-5570
  • 出版年度:2019
  • 卷号:10
  • 期号:4
  • 页码:77-87
  • DOI:10.14569/IJACSA.2019.0100409
  • 出版社:Science and Information Society (SAI)
  • 摘要:With the current increase in the number of online users, there has been a concomitant increase in the amount of data shared online. Techniques for discovering knowledge from these data can provide us with valuable information when it comes to detecting different problems, including violence. Violence is one of the significant problems humanity has faced in recent years all over the world, and this is especially a problem in Arabic countries. To address this issue, this research focuses on detecting violence-related tweets to help in solving this problem. Text mining is an important technique that can be used to find and predict information from text. In this study, a text classification model is built for detecting violence in Arabic dialects on Twitter using different feature-reduction approaches. The experiment comprises bagging, K-nearest neighbors (KNN), and Bayesian boosting using different extraction features, namely, root-based stemming, light stemming, and n-grams. In addition, the study used the following feature-reduction techniques: support vector machine (SVM), Chi-squared (CHI), the Gini index, correlation, rules, information gain (IG), deviation, symmetrical uncertainty, and the IG ratio. The experiment showed that the bagging with tri-gram approach has the highest accuracy at 86.61%, and a combination of IG with SVM from reduction features registers an accuracy of 90.59%.
  • 关键词:Violence; text mining; classification; feature-reduction techniques; Arabic; Twitter posts
国家哲学社会科学文献中心版权所有