首页    期刊浏览 2024年11月08日 星期五
登录注册

文章基本信息

  • 标题:The Role of Rare Terms in Enhancing the Performance of Polynomial Networks Based Text Categorization
  • 本地全文:下载
  • 作者:Mayy M. Al-Tahrawi
  • 期刊名称:Journal of Intelligent Learning Systems and Applications
  • 印刷版ISSN:2150-8402
  • 电子版ISSN:2150-8410
  • 出版年度:2013
  • 卷号:5
  • 期号:2
  • 页码:84-89
  • DOI:10.4236/jilsa.2013.52009
  • 出版社:Scientific Research Publishing
  • 摘要:In this paper, the role of rare or infrequent terms in enhancing the accuracy of English Text Categorization using Polynomial Networks (PNs) is investigated. To study the impact of rare terms in enhancing the accuracy of PNs-based text categorization, different term reduction criteria as well as different term weighting schemes were experimented on the Reuters Corpus using PNs. Each term weighting scheme on each reduced term set was tested once keeping the rare terms and another time removing them. All the experiments conducted in this research show that keeping rare terms substantially improves the performance of Polynomial Networks in Text Categorization, regardless of the term reduction method, the number of terms used in classification, or the term weighting scheme adopted.
  • 关键词:Polynomial Networks; Text Categorization; Document Classification; Infrequent Terms; Rare Terms
国家哲学社会科学文献中心版权所有