首页    期刊浏览 2025年07月05日 星期六
登录注册

文章基本信息

  • 标题:Arabic Tweets Categorization Based on Rough Set Theory
  • 本地全文:下载
  • 作者:Mohammed Bekkali ; Abdelmonaime Lachkar
  • 期刊名称:Computer Science & Information Technology
  • 电子版ISSN:2231-5403
  • 出版年度:2014
  • 卷号:4
  • 期号:11
  • 页码:83-96
  • DOI:10.5121/csit.2014.41109
  • 出版社:Academy & Industry Research Collaboration Center (AIRCC)
  • 摘要:Twitter is a popular microblogging service where users create status messages (called搕weets?. These tweets sometimes express opinions about different topics; and are presented tothe user in a chronological order. This format of presentation is useful to the user since thelatest tweets from are rich on recent news which is generally more interesting than tweets aboutan event that occurred long time back. Merely, presenting tweets in a chronological order maybe too embarrassing to the user, especially if he has many followers. Therefore, there is a needto separate the tweets into different categories and then present the categories to the user.Nowadays Text Categorization (TC) becomes more significant especially for the Arabiclanguage which is one of the most complex languages.In this paper, in order to improve the accuracy of tweets categorization a system based onRough Set Theory is proposed for enrichment the document抯 representation. The effectivenessof our system was evaluated and compared in term of the F-measure of the Na飗e Bayesianclassifier and the Support Vector Machine classifier.
  • 关键词:Arabic Language; Text Categorization; Rough Set Theory; Twitter; Tweets.
国家哲学社会科学文献中心版权所有