首页    期刊浏览 2024年12月03日 星期二
登录注册

文章基本信息

  • 标题:Better Neural Machine Translation by Extracting Linguistic Information fromBERT
  • 本地全文:下载
  • 作者:Hassan S. Shavarani ; Anoop Sarkar
  • 期刊名称:Conference on European Chapter of the Association for Computational Linguistics (EACL)
  • 出版年度:2021
  • 卷号:2021
  • 页码:2772-2783
  • DOI:10.18653/v1/2021.eacl-main.241
  • 语种:English
  • 出版社:ACL Anthology
  • 摘要:Adding linguistic information (syntax or semantics) to neural machine translation (NMT) have mostly focused on using point estimates from pre-trained models. Directly using the capacity of massive pre-trained contextual word embedding models such as BERT(Devlin et al., 2019) has been marginally useful in NMT because effective fine-tuning is difficult to obtain for NMT without making training brittle and unreliable. We augment NMT by extracting dense fine-tuned vector-based linguistic information from BERT instead of using point estimates. Experimental results show that our method of incorporating linguistic information helps NMT to generalize better in a variety of training contexts and is no more difficult to train than conventional Transformer-based NMT.
国家哲学社会科学文献中心版权所有