文章基本信息

标题：LITHUANIAN CONTINUOUS SPEECH CORPUS LRN 0.1: DESIGN AND POTENTIAL APPLICATIONS
本地全文：下载
作者：Sigita Laurinčiukaitė ; Darius Šilingas ; Mantas Skripkauskas 等
期刊名称：European Integration Studies
印刷版ISSN：2335-8831
出版年度：2015
卷号：35
期号：4
DOI：10.5755/j01.itc.35.4.11785
语种：English
出版社：Kaunas University of Technology
摘要：This paper presents design, development and contents of Lithuanian continuous speech corpus LRN 0.1 (Lithuanian Radio News, prototype-version 0.1). The corpus contains 17 hours 23 minutes of records from radio broad-cast news read by 31 speakers. The recorded material is segmented into sentence-length records that are divided into training, development, and evaluation sets. Speech recordings are accompanied by word level transcriptions and auto-matically generated word-to-phone lexicon. The corpus is designed for the constructing and evaluating speaker-inde-pendent continuous speech recognition systems, and may also be used for linguistic research.