首页    期刊浏览 2025年04月29日 星期二
登录注册

文章基本信息

  • 标题:Mixed-Level Neural Machine Translation
  • 本地全文:下载
  • 作者:Thien Nguyen ; Huu Nguyen ; Phuoc Tran
  • 期刊名称:Computational Intelligence and Neuroscience
  • 印刷版ISSN:1687-5265
  • 电子版ISSN:1687-5273
  • 出版年度:2020
  • 卷号:2020
  • 页码:1-7
  • DOI:10.1155/2020/8859452
  • 出版社:Hindawi Publishing Corporation
  • 摘要:Building the first Russian-Vietnamese neural machine translation system, we faced the problem of choosing a translation unit system on which source and target embeddings are based. Available homogeneous translation unit systems with the same translation unit on the source and target sides do not perfectly suit the investigated language pair. To solve the problem, in this paper, we propose a novel heterogeneous translation unit system, considering linguistic characteristics of the synthetic Russian language and the analytic Vietnamese language. Specifically, we decrease the embedding level on the source side by splitting token into subtokens and increase the embedding level on the target side by merging neighboring tokens into supertoken. The experiment results show that the proposed heterogeneous system improves over the existing best homogeneous Russian-Vietnamese translation system by 1.17 BLEU. Our approach could be applied to building translation bots for language pairs with different linguistic characteristics.
国家哲学社会科学文献中心版权所有