首页    期刊浏览 2025年08月09日 星期六
登录注册

文章基本信息

  • 标题:Statistical Machine Translation of Croatian Weather Forecasts: How Much Data Do We Need?
  • 本地全文:下载
  • 作者:Ljubešić, Nikola ; Bago, Petra ; Boras, Damir
  • 期刊名称:Journal of Computing and Information Technology
  • 印刷版ISSN:1330-1136
  • 电子版ISSN:1846-3908
  • 出版年度:2010
  • 卷号:18
  • 期号:4
  • 页码:303-308
  • DOI:10.2498/cit.1001917
  • 出版社:SRCE - Sveučilišni računski centar
  • 摘要:This research is the first step towards developing a system for translating Croatian weather forecasts into multiple languages. This step deals with the Croatian-English language pair. The parallel corpus consists of a one-year sample of the weather forecasts for the Adriatic, consisting of 7,893 sentence pairs. Evaluation is performed by the automatic evaluation measures BLUE, NIST and METEOR, as well as by manually evaluating a sample of 200 translations. We have shown that with a small-sized training set and the state-of-the artMoses system, decoding can be done with 96% accuracy concerning adequacy and fluency. Additional improvement is expected by increasing the training set size. Finally, the correlation of the recorded evaluation measures is explored.
  • 关键词:statistical machine translation; automatic evaluation; manual evaluation; correlation between evaluation measures
国家哲学社会科学文献中心版权所有