首页    期刊浏览 2024年11月07日 星期四
登录注册

文章基本信息

  • 标题:Using Word Embeddings to Enforce Document-Level Lexical Consistency in Machine Translation
  • 本地全文:下载
  • 作者:Eva Martínez Garcia ; Carles Creus ; Cristina España-Bonet
  • 期刊名称:The Prague Bulletin of Mathematical Linguistics
  • 印刷版ISSN:0032-6585
  • 电子版ISSN:1804-0462
  • 出版年度:2017
  • 卷号:108
  • 期号:1
  • 页码:85-96
  • DOI:10.1515/pralin-2017-0011
  • 语种:English
  • 出版社:Walter de Gruyter GmbH
  • 摘要:We integrate new mechanisms in a document-level machine translation decoder to improve the lexical consistency of document translations. First, we develop a document-level feature designed to score the lexical consistency of a translation. This feature, which applies to words that have been translated into different forms within the document, uses word embeddings to measure the adequacy of each word translation given its context. Second, we extend the decoder with a new stochastic mechanism that, at translation time, allows to introduce changes in the translation oriented to improve its lexical consistency. We evaluate our system on English–Spanish document translation, and we conduct automatic and manual assessments of its quality. The automatic evaluation metrics, applied mainly at sentence level, do not reflect significant variations. On the contrary, the manual evaluation shows that the system dealing with lexical consistency is preferred over both a standard sentence-level and a standard document-level phrase-based MT systems.
国家哲学社会科学文献中心版权所有