首页    期刊浏览 2024年11月08日 星期五
登录注册

文章基本信息

  • 标题:Arabic Text Categorization Algorithm Using Vector Evaluation Method
  • 本地全文:下载
  • 作者:Ashraf Odeh ; Aymen Abu-Errub ; Qusai Shambour
  • 期刊名称:International Journal of Computer Science & Information Technology (IJCSIT)
  • 印刷版ISSN:0975-4660
  • 电子版ISSN:0975-3826
  • 出版年度:2014
  • 卷号:6
  • 期号:6
  • 页码:83
  • 出版社:Academy & Industry Research Collaboration Center (AIRCC)
  • 摘要:Text categorization is the process of grouping documents into categories based on their contents. Thisprocess is important to make information retrieval easier, and it became more important due to the hugetextual information available online. The main problem in text categorization is how to improve theclassification accuracy. Although Arabic text categorization is a new promising field, there are a fewresearches in this field. This paper proposes a new method for Arabic text categorization using vectorevaluation. The proposed method uses a categorized Arabic documents corpus, and then the weights of thetested document's words are calculated to determine the document keywords which will be compared withthe keywords of the corpus categorizes to determine the tested document's best category.
  • 关键词:Text Categorization; Arabic Text Classification; Information Retrieval; Data Mining; Machine Learning
国家哲学社会科学文献中心版权所有