文章基本信息

标题：Toward a Complex System for Context Discovery to Index Arabic Documents
本地全文：下载
作者：Mohamed Salim El Bazzi ; Driss Mammass ; Abdelatif Ennaji 等
期刊名称：Journal of Computers
印刷版ISSN：1796-203X
出版年度：2018
卷号：13
期号：8
页码：955-962
DOI：10.17706/jcp.13.8.955-962
语种：English
出版社：Academy Publisher
摘要：Text indexing aims to take the full advantage of textual data to help intelligent programs to make relevant decisions. In order to explore a large amount of textual documents, and to disclose semantic information hidden in unstructured documents, like texts, an effective indexation system is required. In this paper, we propose a new approach for indexing Arabic texts. Based on the semantic proximity and taking into account the contexts contained in each document, our method is denoted contextual indexing. Several algorithms are used for keywords extraction, each of them emphasizes some criterion. However, we target the most descriptive keywords for each document. We also propose a new approach for document modeling. We compared the results obtained using our method with those obtained by an indexation system based on a standard statistical method. The experimental results demonstrate the performance of our approach.
关键词：Contextual indexation; semantic proximity; clustering; Arabic documents.