首页    期刊浏览 2025年06月14日 星期六
登录注册

文章基本信息

  • 标题:Unsupervised Type and Token Identification of Idiomatic Expressions
  • 本地全文:下载
  • 作者:Afsaneh Fazly ; Paul Cook ; Suzanne Stevenson
  • 期刊名称:Computational Linguistics
  • 印刷版ISSN:0891-2017
  • 电子版ISSN:1530-9312
  • 出版年度:2009
  • 卷号:35
  • 期号:1
  • 页码:61-103
  • DOI:10.1162/coli.08-010-R1-07-048
  • 语种:English
  • 出版社:MIT Press
  • 摘要:Idiomatic expressions are plentiful in everyday language, yet they remain mysterious, as it is not clear exactly how people learn and understand them. They are of special interest to linguists, psycholinguists, and lexicographers, mainly because of their syntactic and semantic idiosyncrasies as well as their unclear lexical status. Despite a great deal of research on the properties of idioms in the linguistics literature, there is not much agreement on which properties are characteristic of these expressions. Because of their peculiarities, idiomatic expressions have mostly been overlooked by researchers in computational linguistics. In this article, we look into the usefulness of some of the identified linguistic properties of idioms for their automatic recognition. Specifically, we develop statistical measures that each model a specific property of idiomatic expressions by looking at their actual usage patterns in text. We use these statistical measures in a type-based classification task where we automatically separate idiomatic expressions (expressions with a possible idiomatic interpretation) from similar-on-the-surface literal phrases (for which no idiomatic interpretation is possible). In addition, we use some of the measures in a token identification task where we distinguish idiomatic and literal usages of potentially idiomatic expressions in context.
国家哲学社会科学文献中心版权所有