首页    期刊浏览 2024年11月08日 星期五
登录注册

文章基本信息

  • 标题:The Combination of YAKE and Language Processing for Unsupervised Term Extraction Ontology Learning
  • 本地全文:下载
  • 作者:Rajif Agung Yunmar ; Andika Setiawan ; Hartanto Tantriawan
  • 期刊名称:IOP Conference Series: Earth and Environmental Science
  • 印刷版ISSN:1755-1307
  • 电子版ISSN:1755-1315
  • 出版年度:2020
  • 卷号:537
  • 期号:1
  • DOI:10.1088/1755-1315/537/1/012023
  • 语种:English
  • 出版社:IOP Publishing
  • 摘要:Information that is spread on the internet is available in the form of unstructured texts that can only be understood by humans, but difficult for machines to understand. Ontology learning is a method that can transform information in unstructured forms, into information that can be understood by machines, namely ontology. In ontology learning, the extraction term is one of the stages that must be passed. This stage produces important terms related to a topic before finally being grouped in certain classes. In this study, the term extraction method used is YAKE. The contribution of this research is to investigate the effects of language processing such as stemming and stopword removal when combined with the YAKE method at the term extraction stage. The language processing technique is then applied to the corpus of the test, after that it is as the input to the YAKE term extraction. Testing is conducted with several scenarios, namely: plain YAKE, stemming+YAKE, stopword removal+YAKE, or a combination three of them. These extraction scenario are evaluated by expert for measure the term correctness. The research shows that the combination of stopword removal+YAKE provide the best accuracy of 48%.
国家哲学社会科学文献中心版权所有