首页    期刊浏览 2025年06月16日 星期一
登录注册

文章基本信息

  • 标题:Word-Order Issues in English-to-Urdu Statistical Machine Translation
  • 作者:Bushra Jawaid ; Daniel Zeman
  • 期刊名称:The Prague Bulletin of Mathematical Linguistics
  • 印刷版ISSN:0032-6585
  • 电子版ISSN:1804-0462
  • 出版年度:2011
  • 卷号:95
  • 期号:1
  • 页码:87-106
  • DOI:10.2478/v10108-011-0007-0
  • 语种:English
  • 出版社:Walter de Gruyter GmbH
  • 摘要:We investigate phrase-based statistical machine translation between English and Urdu, two Indo-European languages that differ significantly in their word-order preferences. Reordering of words and phrases is thus a necessary part of the translation process. While local reordering is modeled nicely by phrase-based systems, long-distance reordering is known to be a hard problem. We perform experiments using the Moses SMT system and discuss reordering models available in Moses. We then present our novel, Urdu-aware, yet generalizable approach based on reordering phrases in syntactic parse tree of the source English sentence. Our technique significantly improves quality of English-Urdu translation with Moses, both in terms of BLEU score and of subjective human judgments.
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有