文章基本信息

标题：LITHUANIAN CONTINUOUS SPEECH CORPUS LRN 1: AN IMPROVEMENT
本地全文：下载
作者：Sigita Laurinčiukaitė ; Mark Filipovič ; Laimutis Telksnys 等
期刊名称：Public Policy And Administration
印刷版ISSN：2029-2872
出版年度：2015
卷号：38
期号：3
DOI：10.5755/j01.itc.38.3.12122
语种：English
出版社：Kaunas University of Technology
摘要：This paper presents the development of Lithuanian continuous speech corpus LRN 1 (Lithuanian Radio News, version 1). The corpus was developed from speech corpus LRN 0.1 by increasing the duration of speech corpus (it lasts 20 hours 50 minutes). The major improvement of speech corpus LRN 1 was a development of time-aligned word level annotations of speech signals. Time-aligned word level annotations of speech signals were obtained after a two-stage process: automatic realignment of acoustic models of phonemes and subsequent manual correction of annotations. The improvement of the corpus is useful for constructing and evaluating speaker-independent continuous speech recognition systems and for linguistic research.