首页    期刊浏览 2024年10月05日 星期六
登录注册

文章基本信息

  • 标题:Developing Two Different Novel Techniques for Arabic Text Stemming
  • 本地全文:下载
  • 作者:Mohammad Mustafa ; Afag Salah Aldeen ; Mohammed E. Zidan
  • 期刊名称:Intelligent Information Management
  • 印刷版ISSN:2150-8194
  • 电子版ISSN:2150-8208
  • 出版年度:2019
  • 卷号:11
  • 期号:1
  • 页码:1-23
  • DOI:10.4236/iim.2019.111001
  • 出版社:Scientific Research Publishing
  • 摘要:Stemming is used to produce stem or root of words. The process is vital to different research fields such as text mining, sentiment analysis, and text categorization, etc. Several techniques have been proposed to stemming Arabic text and among them, Khoja and light-10 stemmers are the most widely used. In this paper, we propose and evaluate two different stemming techniques to Arabic that are based on light stemming techniques. The new stemmers are compared to best reported light stemmer, which is light-10. Results and experiments, which were conducted using standard collections, reveal that The proposed stemmers yield 5.13% and 13.1% improvement in retrieval performance over light 10 with 0.369 average precision and 0.397, respectively and the improvement is statistically significant.
  • 关键词:Arabic Language;Arabic Information Retrieval;Light Stemming;Light 10;Extended Light-Stemmer;Linguistic-Based Stemmer
国家哲学社会科学文献中心版权所有