首页    期刊浏览 2024年12月01日 星期日
登录注册

文章基本信息

  • 标题:Learning Domain Ontologies from Document Warehouses and Dedicated Web Sites
  • 本地全文:下载
  • 作者:Roberto Navigli ; Paola Velardi
  • 期刊名称:Computational Linguistics
  • 印刷版ISSN:0891-2017
  • 电子版ISSN:1530-9312
  • 出版年度:2004
  • 卷号:30
  • 期号:2
  • 页码:151-179
  • DOI:10.1162/089120104323093276
  • 语种:English
  • 出版社:MIT Press
  • 摘要:We present a method and a tool, OntoLearn, aimed at the extraction of domain ontologies from Web sites, and more generally from documents shared among the members of virtual organizations. OntoLearn first extracts a domain terminology from available documents. Then, complex domain terms are semantically interpreted and arranged in a hierarchical fashion. Finally, a general-purpose ontology, WordNet, is trimmed and enriched with the detected domain concepts. The major novel aspect of this approach is semantic interpretation, that is, the association of a complex concept with a complex term. This involves finding the appropriate WordNet concept for each word of a terminological string and the appropriate conceptual relations that hold among the concept components. Semantic interpretation is based on a new word sense disambiguation algorithm, called structural semantic interconnections.
国家哲学社会科学文献中心版权所有