文章基本信息

标题：韻律に寄与する音響特徴量を用いた聞きやすい高速話速変換技術
本地全文：下载
作者：今井篤 ; 田澤直幸 ; 岩鼻幸男等
期刊名称：映像情報メディア学会誌
印刷版ISSN：1342-6907
电子版ISSN：1881-6908
出版年度：2012
卷号：66
期号：7
页码：J214-J220
DOI：10.3169/itej.66.J214
出版社：The Institute of Image Information and Television Engineers
摘要：We have developed an intelligible high-speed speech rate conversion technology using the acoustic feature quantities that contribute to prosody. In contrast to the conventional method, which plays back accelerated speech at the same uniform rate from the beginning to end, our proposed approach varies the playback rate adaptively on the basis of acoustic detection of the position of an utterance and any fluctuations in a speaker's fundamental frequency (F0) and power. In so doing, we hope to make high-speed playback easier to listen to by providing the listener with a "slowed-down" playback effect. Since this approach converts speech rate using just the acoustic features of audio data, it can be applied to not only Japanese but other languages as well. While the algorithm we developed in this study is optimized for the Japanese language, we aim to implement the proposed approach in a wider array of commercial devices and customize the technology to various languages.
关键词：高速音声;話速変換;速聴;視覚障害者;DAISY;オーディオブック