首页    期刊浏览 2024年07月19日 星期五
登录注册

文章基本信息

  • 标题:Enriching Documents with Context Terms from Cross-Domain Ontologies
  • 本地全文:下载
  • 作者:Benjamin KÖHNCKE ; Wolf-Tilo BALKE
  • 期刊名称:Information and Media Technologies
  • 电子版ISSN:1881-0896
  • 出版年度:2015
  • 卷号:10
  • 期号:2
  • 页码:294-304
  • DOI:10.11185/imt.10.294
  • 出版社:Information and Media Technologies Editorial Board
  • 摘要:Entity-centric search has become a demanding problem for many domains on the Web. In particular, the suitable contextualization of result documents poses challenges in terms of selecting most adequate indexing terms for later retrieval. This holds even more, if no generally recognized ontologies for the respective domain are available. In this paper, we show that cross-domain ontology terms are actually more useful for indexing, than salient keywords taken from the documents. Moreover, learning typical contexts for groups of entities from collections indexed by strong cross-domain ontologies can considerably improve retrieval effectiveness. Our extensive experiments prove these results on real world document collections from the area of chemistry and computer science. In fact, our evaluation in different document retrieval scenarios show a vital increase of retrieval precision of up to 87% using documents annotated with cross-domain ontology terms as compared to 53% for BM25 searches and 43% for documents annotated with Wikipedia categories.
国家哲学社会科学文献中心版权所有