首页    期刊浏览 2024年11月27日 星期三
登录注册

文章基本信息

  • 标题:Selection of Suitable Features for Modeling the Durations of Syllables
  • 本地全文:下载
  • 作者:Krothapalli S. Rao ; Shashidhar G. Koolagudi
  • 期刊名称:Journal of Software Engineering and Applications
  • 印刷版ISSN:1945-3116
  • 电子版ISSN:1945-3124
  • 出版年度:2010
  • 卷号:3
  • 期号:12
  • 页码:1107-1117
  • DOI:10.4236/jsea.2010.312129
  • 出版社:Scientific Research Publishing
  • 摘要:Acoustic analysis and synthesis experiments have shown that duration and intonation patterns are the two most important prosodic features responsible for the quality of synthesized speech. In this paper a set of features are proposed which will influence the duration patterns of the sequence of the sound units. These features are derived from the results of the duration analysis. Duration analysis provides a rough estimate of features, which affect the duration patterns of the sequence of the sound units. But, the prediction of durations from these features using either linear models or with a fixed rulebase is not accurate. From the analysis it is observed that there exists a gross trend in durations of syllables with respect to syllable position in the phrase, syllable position in the word, word position in the phrase, syllable identity and the context of the syllable (preceding and the following syllables). These features can be further used to predict the durations of the syllables more accurately by exploring various nonlinear models. For analying the durations of sound units, broadcast news data in Telugu is used as the speech corpus. The prediction accuracy of the duration models developed using rulebases and neural networks is evaluated using the objective measures such as percentage of syllables predicted within the specified deviation, average prediction error (µ), standard deviation (σ) and correlation coefficient (γ).
  • 关键词:Prosody; Syllable Duration; Syllable Position; Syllable Context; Syllable Identity; Feed Forward Neural Network
国家哲学社会科学文献中心版权所有