首页    期刊浏览 2025年06月14日 星期六
登录注册

文章基本信息

  • 标题:Automatically Identifying the Source Words of Lexical Blends in English
  • 本地全文:下载
  • 作者:Paul Cook ; Suzanne Stevenson
  • 期刊名称:Computational Linguistics
  • 印刷版ISSN:0891-2017
  • 电子版ISSN:1530-9312
  • 出版年度:2010
  • 卷号:36
  • 期号:1
  • 页码:129-149
  • DOI:10.1162/coli.2010.36.1.36104
  • 语种:English
  • 出版社:MIT Press
  • 摘要:Newly coined words pose problems for natural language processing systems because they are not in a system's lexicon, and therefore no lexical information is available for such words. A common way to form new words is lexical blending, as in cosmeceutical, a blend of cosmetic and pharmaceutical . We propose a statistical model for inferring a blend's source words drawing on observed linguistic properties of blends; these properties are largely based on the recognizability of the source words in a blend. We annotate a set of 1,186 recently coined expressions which includes 515 blends, and evaluate our methods on a 324-item subset. In this first study of novel blends we achieve an accuracy of 40% on the task of inferring a blend's source words, which corresponds to a reduction in error rate of 39% over an informed baseline. We also give preliminary results showing that our features for source word identification can be used to distinguish blends from other kinds of novel words.
国家哲学社会科学文献中心版权所有