首页    期刊浏览 2024年11月06日 星期三
登录注册

文章基本信息

  • 标题:Measuring Word Meaning in Context
  • 本地全文:下载
  • 作者:Katrin Erk ; Diana McCarthy ; Nicholas Gaylord
  • 期刊名称:Computational Linguistics
  • 印刷版ISSN:0891-2017
  • 电子版ISSN:1530-9312
  • 出版年度:2013
  • 卷号:39
  • 期号:3
  • 页码:511-554
  • DOI:10.1162/COLI_a_00142
  • 语种:English
  • 出版社:MIT Press
  • 摘要:Word sense disambiguation (WSD) is an old and important task in computational linguistics that still remains challenging, to machines as well as to human annotators. Recently there have been several proposals for representing word meaning in context that diverge from the traditional use of a single best sense for each occurrence. They represent word meaning in context through multiple paraphrases, as points in vector space, or as distributions over latent senses. New methods of evaluating and comparing these different representations are needed. In this paper we propose two novel annotation schemes that characterize word meaning in context in a graded fashion. In WS sim annotation, the applicability of each dictionary sense is rated on an ordinal scale. U sim annotation directly rates the similarity of pairs of usages of the same lemma, again on a scale. We find that the novel annotation schemes show good inter-annotator agreement, as well as a strong correlation with traditional single-sense annotation and with annotation of multiple lexical paraphrases. Annotators make use of the whole ordinal scale, and give very fine-grained judgments that “mix and match” senses for each individual usage. We also find that the U sim ratings obey the triangle inequality, justifying models that treat usage similarity as metric. There has recently been much work on grouping senses into coarse-grained groups. We demonstrate that graded WS sim and U sim ratings can be used to analyze existing coarse-grained sense groupings to identify sense groups that may not match intuitions of untrained native speakers. In the course of the comparison, we also show that the WS sim ratings are not subsumed by any static sense grouping.
国家哲学社会科学文献中心版权所有