首页    期刊浏览 2025年07月07日 星期一
登录注册

文章基本信息

  • 标题:Explaining and Generalizing Skip-Gram through Exponential Family Principal Component Analysis
  • 本地全文:下载
  • 作者:Ryan Cotterell ; Adam Poliak ; Benjamin Van Durme
  • 期刊名称:Conference on European Chapter of the Association for Computational Linguistics (EACL)
  • 出版年度:2017
  • 卷号:2017
  • 页码:175-181
  • 语种:English
  • 出版社:ACL Anthology
  • 摘要:The popular skip-gram model induces word embeddings by exploiting the signal from word-context coocurrence. We offer a new interpretation of skip-gram based on exponential family PCA-a form of matrix factorization to generalize the skip-gram model to tensor factorization. In turn, this lets us train embeddings through richer higher-order coocurrences, e.g., triples that include positional information (to incorporate syntax) or morphological information (to share parameters across related words). We experiment on 40 languages and show our model improves upon skip-gram.
国家哲学社会科学文献中心版权所有