首页    期刊浏览 2025年07月03日 星期四
登录注册

文章基本信息

  • 标题:Effective Arabic Stemmer Based Hybrid Approach for Arabic Text Categorization
  • 本地全文:下载
  • 作者:Meryeme Hadni ; Said Alaoui Ouatik ; Abdelmonaime Lachkar
  • 期刊名称:International Journal of Data Mining & Knowledge Management Process
  • 印刷版ISSN:2231-007X
  • 电子版ISSN:2230-9608
  • 出版年度:2013
  • 卷号:3
  • 期号:4
  • 出版社:Academy & Industry Research Collaboration Center (AIRCC)
  • 摘要:Text pre-processing of Arabic Language is a challenge and crucial stage in Text Categorization (TC) particularly and Text Mining (TM) generally. Stemming algorithms can be employed in Arabic text pre-processing to reduces words to their stems/or root. Arabic stemming algorithms can be ranked, according to three category, as root-based approach (ex. Khoja); stem-based approach (ex. Larkey); and statistical approach (ex. N-Garm).However, no stemming of this language is perfect: The existing stemmers have a small efficiency. In this paper, in order to improve the accuracy of stemming and therefore the accuracy of our proposed TC system, an efficient hybrid method is proposed for stemming Arabic text. The effectiveness of the aforementioned four methods was evaluated and compared in term of the F-measure of the Na.ve Bayesian classifier and the Support Vector Machine classifier used in our TC system. The proposed stemming algorithm was found to supersede the other stemming ones: The obtained results illustrate that using the proposed stemmer enhances greatly the performance of Arabic Text Categorization
  • 关键词:Arabic Language; Stemming Approaches; Text Categorization
国家哲学社会科学文献中心版权所有