首页    期刊浏览 2024年07月07日 星期日
登录注册

文章基本信息

  • 标题:Language Model Adaptation Using Machine-Translated Text for Resource-Deficient Languages
  • 本地全文:下载
  • 作者:Arnar Thor Jensson ; Koji Iwano ; Sadaoki Furui
  • 期刊名称:EURASIP Journal on Audio, Speech, and Music Processing
  • 印刷版ISSN:1687-4714
  • 电子版ISSN:1687-4722
  • 出版年度:2008
  • 卷号:2008
  • DOI:10.1155/2008/573832
  • 出版社:Hindawi Publishing Corporation
  • 摘要:

    Text corpus size is an important issue when building a language model (LM). This is a particularly important issue for languages where little data is available. This paper introduces an LM adaptation technique to improve an LM built using a small amount of task-dependent text with the help of a machine-translated text corpus. Icelandic speech recognition experiments were performed using data, machine translated (MT) from English to Icelandic on a word-by-word and sentence-by-sentence basis. LM interpolation using the baseline LM and an LM built from either word-by-word or sentence-by-sentence translated text reduced the word error rate significantly when manually obtained utterances used as a baseline were very sparse.

国家哲学社会科学文献中心版权所有