首页    期刊浏览 2024年07月05日 星期五
登录注册

文章基本信息

  • 标题:Auditory Sparse Representation for Robust Speaker Recognition Based on Tensor Structure
  • 本地全文:下载
  • 作者:Qiang Wu ; Liqing Zhang
  • 期刊名称:EURASIP Journal on Audio, Speech, and Music Processing
  • 印刷版ISSN:1687-4714
  • 电子版ISSN:1687-4722
  • 出版年度:2008
  • 卷号:2008
  • DOI:10.1155/2008/578612
  • 出版社:Hindawi Publishing Corporation
  • 摘要:

    This paper investigates the problem of speaker recognition in noisy conditions. A new approach called nonnegative tensor principal component analysis (NTPCA) with sparse constraint is proposed for speech feature extraction. We encode speech as a general higher-order tensor in order to extract discriminative features in spectrotemporal domain. Firstly, speech signals are represented by cochlear feature based on frequency selectivity characteristics at basilar membrane and inner hair cells; then, low-dimension sparse features are extracted by NTPCA for robust speaker modeling. The useful information of each subspace in the higher-order tensor can be preserved. Alternating projection algorithm is used to obtain a stable solution. Experimental results demonstrate that our method can increase the recognition accuracy specifically in noisy environments.

国家哲学社会科学文献中心版权所有