首页    期刊浏览 2024年07月05日 星期五
登录注册

文章基本信息

  • 标题:Acquiring Domain-Specific Knowledge for WordNet from a Terminological Database
  • 本地全文:下载
  • 作者:Alberto Sim{~o}es ; Xavier G{'o}mez Guinovart
  • 期刊名称:OASIcs : OpenAccess Series in Informatics
  • 电子版ISSN:2190-6807
  • 出版年度:2019
  • 卷号:74
  • 页码:1-13
  • DOI:10.4230/OASIcs.SLATE.2019.6
  • 出版社:Schloss Dagstuhl -- Leibniz-Zentrum fuer Informatik
  • 摘要:In this research we explore a terminological database (Termoteca) in order to expand the Portuguese and Galician wordnets (PULO and Galnet) with the addition of new synset variants (word forms for a concept), usage examples for the variants, and synset glosses or definitions. The methodology applied in this experiment is based on the alignment between concepts of WordNet (synsets) and concepts described in Termoteca (terminological records), taking into account the lexical forms in both resources, their morphological category and their knowledge domains, using the information provided by the WordNet Domains Hierarchy and the Termoteca field domains to reduce the incidence of polysemy and homography in the results of the experiment. The results obtained confirm our hypothesis that the combined use of the semantic domain information included in both resources makes it possible to minimise the problem of lexical ambiguity and to obtain a very acceptable index of precision in terminological information extraction tasks, attaining a precision above 89% when there are two or more different languages sharing at least one lexical form between the synset in Galnet and the Termoteca record.
  • 关键词:WordNet; Terminology; Lexical Resources; Natural Language Processing
国家哲学社会科学文献中心版权所有