首页    期刊浏览 2024年09月18日 星期三
登录注册

文章基本信息

  • 标题:OntoLearn Reloaded: A Graph-Based Algorithm for Taxonomy Induction
  • 本地全文:下载
  • 作者:Paola Velardi ; Stefano Faralli ; Roberto Navigli
  • 期刊名称:Computational Linguistics
  • 印刷版ISSN:0891-2017
  • 电子版ISSN:1530-9312
  • 出版年度:2013
  • 卷号:39
  • 期号:3
  • 页码:665-707
  • DOI:10.1162/COLI_a_00146
  • 语种:English
  • 出版社:MIT Press
  • 摘要:In 2004 we published in this journal an article describing OntoLearn, one of the first systems to automatically induce a taxonomy from documents and Web sites. Since then, OntoLearn has continued to be an active area of research in our group and has become a reference work within the community. In this paper we describe our next-generation taxonomy learning methodology, which we name OntoLearn Reloaded. Unlike many taxonomy learning approaches in the literature, our novel algorithm learns both concepts and relations entirely from scratch via the automated extraction of terms, definitions, and hypernyms. This results in a very dense, cyclic and potentially disconnected hypernym graph. The algorithm then induces a taxonomy from this graph via optimal branching and a novel weighting policy. Our experiments show that we obtain high-quality results, both when building brand-new taxonomies and when reconstructing sub-hierarchies of existing taxonomies.
国家哲学社会科学文献中心版权所有