期刊名称:The Prague Bulletin of Mathematical Linguistics
印刷版ISSN:0032-6585
电子版ISSN:1804-0462
出版年度:2011
卷号:96
期号:1
页码:49-58
DOI:10.2478/v10108-011-0010-5
语种:English
出版社:Walter de Gruyter GmbH
摘要:This paper describes Ncode, an open source statistical machine translation (SMT) toolkit for translation models estimated as n -gram language models of bilingual units ( tuples ). This toolkit includes tools for extracting tuples, estimating models and performing translation. It can be easily coupled to several other open source toolkits to yield a complete SMT pipeline. In this article, we review the main features of the toolkit and explain how to build a translation engine with Ncode. We also report a short comparison with the widely known Moses system. Results show that Ncode outperforms Moses in terms of memory requirements and translation speed. Ncode also achieves slightly higher accuracy results.