首页    期刊浏览 2024年11月07日 星期四
登录注册

文章基本信息

  • 标题:A Hybrid Method N-Grams-TFIDF with radial basis for indexing and classification of Arabic documents
  • 本地全文:下载
  • 作者:Taher Zaki ; Youssef Es-saady ; Driss Mammass
  • 期刊名称:International Journal of Software Engineering and Its Applications
  • 印刷版ISSN:1738-9984
  • 出版年度:2014
  • 卷号:8
  • 期号:2
  • 页码:127-144
  • DOI:10.14257/ijseia.2014.8.2.13
  • 出版社:SERSC
  • 摘要:In this paper, we propose a hybrid system for contextual and semantic indexing of Arabic documents, bringing an improvement to classical models based on n-grams and the TFIDF model. This new approach takes into account the concept of the semantic vicinity of terms. We proceed in fact by the calculation of similarity between words using an hybridization of NGRAMs-TFIDF statistical measures and a kernel function in order to identify relevant descriptors. Terminological resources such as graphs and semantic dictionaries are integrated into the system to improve the indexing and the classification processes.
  • 关键词:Arabic documents; classification; indexing; radial basis function; n-grams; ; tfidf
国家哲学社会科学文献中心版权所有