首页    期刊浏览 2024年11月30日 星期六
登录注册

文章基本信息

  • 标题:Arabic ontology extraction model from unstructured text
  • 本地全文:下载
  • 作者:Yasser Mohamed Saber ; Hala Abdel-Galil ; Mohamed Abd El-Fatah Belal
  • 期刊名称:Journal of King Saud University @?C Computer and Information Sciences
  • 印刷版ISSN:1319-1578
  • 出版年度:2022
  • 卷号:34
  • 期号:8
  • 页码:6066-6076
  • 语种:English
  • 出版社:Elsevier
  • 摘要:Ontologies enable the sharing and reusing of knowledge, allowing the interoperation and combination of information systems. Automatic ontological relation extraction from text is an important factor for representing documents and their contents in a useful computerized manner. This paper proposes a model for automatic ontology extraction from Arabic text by parsing sentences and extracting part of speech (POS). Then, the proposed rule-based model is applied to extract the triple attributes of a sentence (subject, predicate, and object) from the parsing tree. Finally, the semantic relations can be used to extract new triples inferred from the initially extracted triples. The results were evaluated using two methods: the first method was based on a comparison with manual extraction because there is no standard method for measuring Arabic triple extraction; the second method is based on translating the dataset into English and comparing the output results using the Stanford dependencies extraction web tool. The proposed model achieved an accuracy of 73.6% for Arabic triple extraction and a 35% increase in overall triples owing to new inferred triples. However, when the dataset was translated into English and tested by the Stanford dependencies extraction web tool, the accuracy was 71.8% without inferred triples.
国家哲学社会科学文献中心版权所有