首页    期刊浏览 2024年11月07日 星期四
登录注册

文章基本信息

  • 标题:Robust Automatic Speech Recognition Features using Complex Wavelet Packet Transform Coefficients
  • 作者:Tjong Wan Sen ; Bambang Riyanto Trilaksono ; Arry Akhmad Arman
  • 期刊名称:Journal of ICT Research and Applications
  • 印刷版ISSN:2337-5787
  • 电子版ISSN:2338-5499
  • 出版年度:2009
  • 卷号:3
  • 期号:2
  • 页码:123-134
  • 语种:English
  • 出版社:Institut Teknologi Bandung
  • 其他摘要:To improve the performance of phoneme based Automatic Speech Recognition (ASR) in noisy environment; we developed a new technique that could add robustness to clean phonemes features. These robust features are obtained from Complex Wavelet Packet Transform (CWPT) coefficients. Since the CWPT coefficients represent all different frequency bands of the input signal, decomposing the input signal into complete CWPT tree would also cover all frequencies involved in recognition process. For time overlapping signals with different frequency contents, e. g. phoneme signal with noises, its CWPT coefficients are the combination of CWPT coefficients of phoneme signal and CWPT coefficients of noises. The CWPT coefficients of phonemes signal would be changed according to frequency components contained in noises. Since the numbers of phonemes in every language are relatively small (limited) and already well known, one could easily derive principal component vectors from clean training dataset using Principal Component Analysis (PCA). These principal component vectors could be used then to add robustness and minimize noises effects in testing phase. Simulation results, using Alpha Numeric 4 (AN4) from Carnegie Mellon University and NOISEX-92 examples from Rice University, showed that this new technique could be used as features extractor that improves the robustness of phoneme based ASR systems in various adverse noisy conditions and still preserves the performance in clean environments.
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有