首页    期刊浏览 2024年11月24日 星期日
登录注册

文章基本信息

  • 标题:Graph-structured Conditional Random Fields for Named Entity Categorization in Wikipedia
  • 本地全文:下载
  • 作者:Yotaro Watanabe ; Masayuki Asahara ; Yuji Matsumoto
  • 期刊名称:人工知能学会論文誌
  • 印刷版ISSN:1346-0714
  • 电子版ISSN:1346-8030
  • 出版年度:2008
  • 卷号:23
  • 期号:4
  • 页码:245-254
  • DOI:10.1527/tjsai.23.245
  • 出版社:The Japanese Society for Artificial Intelligence
  • 摘要:This paper presents a method for categorizing named entities in Wikipedia. In Wikipedia, an anchor text is glossed in a linked HTML text. We formalize named entity categorization as a task of categorizing anchor texts with linked HTML texts which glosses a named entity. Using this representation, we introduce a graph structure in which anchor texts are regarded as nodes. In order to incorporate HTML structure on the graph, three types of cliques are defined based on the HTML tree structure. We propose a method with Conditional Random Fields (CRFs) to categorize the nodes on the graph. Since the defined graph may include cycles, the exact inference of CRFs is computationally expensive. We introduce an approximate inference method using Tree-based Reparameterization (TRP) to reduce computational cost. In experiments, our proposed model obtained significant improvements compare to baseline models that use Support Vector Machines.
  • 关键词:named entity acquisition ; collective classification ; conditional random fields
国家哲学社会科学文献中心版权所有