首页    期刊浏览 2024年11月25日 星期一
登录注册

文章基本信息

  • 标题:Multiple Translation-Engine-based Hypotheses and Edit-Distance-based Rescoring for a Greedy Decoder for Statistical Machine Translation
  • 本地全文:下载
  • 作者:Michael Paul ; Eiichiro Sumita ; Seiichi Yamamoto
  • 期刊名称:Information and Media Technologies
  • 电子版ISSN:1881-0896
  • 出版年度:2006
  • 卷号:1
  • 期号:1
  • 页码:446-460
  • DOI:10.11185/imt.1.446
  • 出版社:Information and Media Technologies Editorial Board
  • 摘要:This paper extends a greedy decoder for statistical machine translation (SMT), which searches for an optimal translation by using SMT models starting from a decoder seed, i.e., the source language input paired with an initial translation hypothesis. First, the outputs generated by multiple translation engines are utilized as the initial translation hypotheses, whereby their variations reduce local optima problems inherent in the search. Second, a rescoring method based on the edit-distance between the initial translation hypothesis and the outputs of the decoder is used to compensate for problems of conventional greedy decoding solely based on statistical models. Our approach is evaluated for the translation of dialogues in the travel domain, and the results show that it drastically improves translation quality.
国家哲学社会科学文献中心版权所有