首页    期刊浏览 2024年11月29日 星期五
登录注册

文章基本信息

  • 标题:A Parallel Corpus for Evaluating Machine Translation betweenArabic andEuropean Languages
  • 本地全文:下载
  • 作者:Nizar Habash ; Nasser Zalmout ; Dima Taji
  • 期刊名称:Conference on European Chapter of the Association for Computational Linguistics (EACL)
  • 出版年度:2017
  • 卷号:2017
  • 页码:235-241
  • 语种:English
  • 出版社:ACL Anthology
  • 摘要:We present Arab-Acquis, a large publicly available dataset for evaluating machine translation between 22 European languages and Arabic. Arab-Acquis consists of over 12,000 sentences from the JRC-Acquis (Acquis Communautaire) corpus translated twice by professional translators, once from English and once from French, and totaling over 600,000 words. The corpus follows previous data splits in the literature for tuning, development, and testing. We describe the corpus and how it was created. We also present the first benchmarking results on translating to and from Arabic for 22 European languages.
国家哲学社会科学文献中心版权所有