首页    期刊浏览 2024年09月20日 星期五
登录注册

文章基本信息

  • 标题:Ncode: an Open Source Bilingual N-gram SMT Toolkit
  • 作者:Josep Crego ; François Yvon ; José Mariño
  • 期刊名称:The Prague Bulletin of Mathematical Linguistics
  • 印刷版ISSN:0032-6585
  • 电子版ISSN:1804-0462
  • 出版年度:2011
  • 卷号:96
  • 期号:1
  • 页码:49-58
  • DOI:10.2478/v10108-011-0010-5
  • 语种:English
  • 出版社:Walter de Gruyter GmbH
  • 摘要:This paper describes Ncode, an open source statistical machine translation (SMT) toolkit for translation models estimated as n -gram language models of bilingual units ( tuples ). This toolkit includes tools for extracting tuples, estimating models and performing translation. It can be easily coupled to several other open source toolkits to yield a complete SMT pipeline. In this article, we review the main features of the toolkit and explain how to build a translation engine with Ncode. We also report a short comparison with the widely known Moses system. Results show that Ncode outperforms Moses in terms of memory requirements and translation speed. Ncode also achieves slightly higher accuracy results.
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有