首页    期刊浏览 2024年11月08日 星期五
登录注册

文章基本信息

  • 标题:Toward a Complex System for Context Discovery to Index Arabic Documents
  • 本地全文:下载
  • 作者:Mohamed Salim El Bazzi ; Driss Mammass ; Abdelatif Ennaji
  • 期刊名称:Journal of Computers
  • 印刷版ISSN:1796-203X
  • 出版年度:2018
  • 卷号:13
  • 期号:8
  • 页码:955-962
  • DOI:10.17706/jcp.13.8.955-962
  • 语种:English
  • 出版社:Academy Publisher
  • 摘要:Text indexing aims to take the full advantage of textual data to help intelligent programs to make relevant decisions. In order to explore a large amount of textual documents, and to disclose semantic information hidden in unstructured documents, like texts, an effective indexation system is required. In this paper, we propose a new approach for indexing Arabic texts. Based on the semantic proximity and taking into account the contexts contained in each document, our method is denoted contextual indexing. Several algorithms are used for keywords extraction, each of them emphasizes some criterion. However, we target the most descriptive keywords for each document. We also propose a new approach for document modeling. We compared the results obtained using our method with those obtained by an indexation system based on a standard statistical method. The experimental results demonstrate the performance of our approach.
  • 关键词:Contextual indexation; semantic proximity; clustering; Arabic documents.
国家哲学社会科学文献中心版权所有