首页    期刊浏览 2024年09月18日 星期三
登录注册

文章基本信息

  • 标题:Empirical Investigation of Optimization Algorithms in Neural Machine Translation
  • 本地全文:下载
  • 作者:Parnia Bahar ; Tamer Alkhouli ; Jan-Thorsten Peter
  • 期刊名称:The Prague Bulletin of Mathematical Linguistics
  • 印刷版ISSN:0032-6585
  • 电子版ISSN:1804-0462
  • 出版年度:2017
  • 卷号:108
  • 期号:1
  • 页码:13-25
  • DOI:10.1515/pralin-2017-0005
  • 语种:English
  • 出版社:Walter de Gruyter GmbH
  • 摘要:Training neural networks is a non-convex and a high-dimensional optimization problem. In this paper, we provide a comparative study of the most popular stochastic optimization techniques used to train neural networks. We evaluate the methods in terms of convergence speed, translation quality, and training stability. In addition, we investigate combinations that seek to improve optimization in terms of these aspects. We train state-of-the-art attention-based models and apply them to perform neural machine translation. We demonstrate our results on two tasks: WMT 2016 En→Ro and WMT 2015 De→En.
国家哲学社会科学文献中心版权所有