首页    期刊浏览 2025年04月20日 星期日
登录注册

文章基本信息

  • 标题:Significance of Joint Features Derived from the Modified Group Delay Function in Speech Processing
  • 本地全文:下载
  • 作者:Rajesh M. Hegde ; Hema A. Murthy ; V. R. R. Gadde
  • 期刊名称:EURASIP Journal on Audio, Speech, and Music Processing
  • 印刷版ISSN:1687-4714
  • 电子版ISSN:1687-4722
  • 出版年度:2007
  • 卷号:2007
  • DOI:10.1155/2007/79032
  • 出版社:Hindawi Publishing Corporation
  • 摘要:

    This paper investigates the significance of combining cepstral features derived from the modified group delay function and from the short-time spectral magnitude like the MFCC. The conventional group delay function fails to capture the resonant structure and the dynamic range of the speech spectrum primarily due to pitch periodicity effects. The group delay function is modified to suppress these spikes and to restore the dynamic range of the speech spectrum. Cepstral features are derived from the modified group delay function, which are called the modified group delay feature (MODGDF). The complementarity and robustness of the MODGDF when compared to the MFCC are also analyzed using spectral reconstruction techniques. Combination of several spectral magnitude-based features and the MODGDF using feature fusion and likelihood combination is described. These features are then used for three speech processing tasks, namely, syllable, speaker, and language recognition. Results indicate that combining MODGDF with MFCC at the feature level gives significant improvements for speech recognition tasks in noise. Combining the MODGDF and the spectral magnitude-based features gives a significant increase in recognition performance of 11 % at best, while combining any two features derived from the spectral magnitude does not give any significant improvement.

国家哲学社会科学文献中心版权所有