首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:On Usable Speech Detection by Linear Multi-Scale Decomposition for Speaker Identification
  • 其他标题:On Usable Speech Detection by Linear Multi-Scale Decomposition for Speaker Identification
  • 本地全文:下载
  • 作者:Wajdi Ghezaiel ; Amel Ben Slimane ; Ezzedine Ben Braiek
  • 期刊名称:International Journal of Electrical and Computer Engineering
  • 电子版ISSN:2088-8708
  • 出版年度:2016
  • 卷号:6
  • 期号:6
  • 页码:2766-2772
  • DOI:10.11591/ijece.v6i6.pp2766-2772
  • 语种:English
  • 出版社:Institute of Advanced Engineering and Science (IAES)
  • 摘要:Usable speech is a novel concept of processing co-channel speech data. It is proposed to extract minimally corrupted speech that is considered useful for various speech processing systems. In this paper, we are interested for co-channel speaker identification (SID). We employ a new proposed usable speech extraction method based on the pitch information obtained from linear multi-scale decomposition by discrete wavelet transform. The idea is to retain the speech segments that have only one pitch detected and remove the others. Detected Usable speech was used as input for speaker identification system. The system is evaluated on co-channel speech and results show a significant improvement across various Target to Interferer Ratio (TIR) for speaker identification system.
  • 其他摘要:Usable speech is a novel concept of processing co-channel speech data. It is proposed to extract minimally corrupted speech that is considered useful for various speech processing systems. In this paper, we are interested for co-channel speaker identification (SID). We employ a new proposed usable speech extraction method based on the pitch information obtained from linear multi-scale decomposition by discrete wavelet transform. The idea is to retain the speech segments that have only one pitch detected and remove the others. Detected Usable speech was used as input for speaker identification system. The system is evaluated on co-channel speech and results show a significant improvement across various Target to Interferer Ratio (TIR) for speaker identification system.
  • 关键词:Signal processing; Speech processing;co-channel speech; Usable speech; Multi-scale decomposition; Discrete wavelet transform; Speaker identification
国家哲学社会科学文献中心版权所有