首页    期刊浏览 2025年06月12日 星期四
登录注册

文章基本信息

  • 标题:Unsupervised Word Polysemy Quantification with Multiresolution Grids of Contextual Embeddings
  • 本地全文:下载
  • 作者:Christos Xypolopoulos ; Antoine Tixier ; Michalis Vazirgiannis
  • 期刊名称:Conference on European Chapter of the Association for Computational Linguistics (EACL)
  • 出版年度:2021
  • 卷号:2021
  • 页码:3391-3401
  • DOI:10.18653/v1/2021.eacl-main.297
  • 语种:English
  • 出版社:ACL Anthology
  • 摘要:The number of senses of a given word, or polysemy, is a very subjective notion, which varies widely across annotators and resources. We propose a novel method to estimate polysemy based on simple geometry in the contextual embedding space. Our approach is fully unsupervised and purely data-driven. Through rigorous experiments, we show that our rankings are well correlated, with strong statistical significance, with 6 different rankings derived from famous human-constructed resources such as WordNet, OntoNotes, Oxford, Wikipedia, etc., for 6 different standard metrics. We also visualize and analyze the correlation between the human rankings and make interesting observations. A valuable by-product of our method is the ability to sample, at no extra cost, sentences containing different senses of a given word. Finally, the fully unsupervised nature of our approach makes it applicable to any language. Code and data are publicly available https://github.com/ksipos/polysemy-assessment .
国家哲学社会科学文献中心版权所有