首页    期刊浏览 2024年09月18日 星期三
登录注册

文章基本信息

  • 标题:Classification-Based Approach for Hybridizing Statistical and Rule-Based Machine Translation
  • 本地全文:下载
  • 作者:Park, Eun-Jin ; Kwon, Oh-Woog ; Kim, Kangil
  • 期刊名称:ETRI Journal
  • 印刷版ISSN:1225-6463
  • 电子版ISSN:2233-7326
  • 出版年度:2015
  • 卷号:37
  • 期号:3
  • 页码:541-550
  • DOI:10.4218/etrij.15.0114.1017
  • 语种:English
  • 出版社:Electronics and Telecommunications Research Institute
  • 摘要:In this paper, we propose a classification-based approach for hybridizing statistical machine translation and rulebased machine translation. Both the training dataset used in the learning of our proposed classifier and our feature extraction method affect the hybridization quality. To create one such training dataset, a previous approach used auto-evaluation metrics to determine from a set of component machine translation (MT) systems which gave the more accurate translation (by a comparative method). Once this had been determined, the most accurate translation was then labelled in such a way so as to indicate the MT system from which it came. In this previous approach, when the metric evaluation scores were low, there existed a high level of uncertainty as to which of the component MT systems was actually producing the better translation. To relax such uncertainty or error in classification, we propose an alternative approach to such labeling; that is, a cut-off method. In our experiments, using the aforementioned cut-off method in our proposed classifier, we managed to achieve a translation accuracy of 81.5% - a 5.0% improvement over existing methods.
  • 关键词:Machine translation;hybrid machine translation;automatic labeling;rule-based machine translation;statistical machine translation
国家哲学社会科学文献中心版权所有