首页    期刊浏览 2024年09月15日 星期日
登录注册

文章基本信息

  • 标题:Performance Analysis of Statistical Approaches and NMF Approaches for Speech Enhancement
  • 本地全文:下载
  • 作者:Ravi Kumar Kandagatla ; P V Subbaiah
  • 期刊名称:International Journal of Image, Graphics and Signal Processing
  • 印刷版ISSN:2074-9074
  • 电子版ISSN:2074-9082
  • 出版年度:2019
  • 卷号:11
  • 期号:7
  • 页码:9-38
  • DOI:10.5815/ijigsp.2019.07.02
  • 出版社:MECS Publisher
  • 摘要:Super-Gaussian Based Bayesian Estimators plays significant role in noise reduction. However, the traditional Bayesian Estimators process only DFT spectral amplitude of noisy speech and the phase is left unprocessed. While deriving Bayesian estimators, consideration of phase information provides improved results. The main objective of this paper is twofold. Firstly, the Super-Gaussian based Complex speech coefficients given Uncertain Phase (CUP) based Bayesian estimators are compared under different noise conditions like White noise, Babble noise, Pink noise, Modulated Pink noise, Factory noise, Car noise, Street noise, F16 noise and M109 noise. Secondly, a novel speech enhancement method is proposed by combining CUP estimators with different NMF approaches and online bases updation. The statistical estimators show less effective results under completely non-stationary assumptions. Non-negative Matrix Factorization (NMF) based algorithms show better performance for non stationary noises. The drawback of NMF is, it requires training and/or requires clean speech and noise signals. This drawback can be overcome by taking the advantages of both statistical approaches and NMF approaches. Such approaches like Posteriori Regularized NMF (PR-NMF), Weibull Rayleigh NMF (WR-NMF), Nakagami Rayleigh (NR-NMF), CUP estimator with Gamma and Generalized Gamma distributions + NMF + Online bases Update (CUP-GG + NMF + OU) and CUP-GG + WR-NMF / NR-NMF + OU are considered for comparison. The objective of this paper is to analyze the performance of speech enhancement methods using Bayesian estimators, NMF approaches, Combination of statistical and NMF approaches. The objective performance measures Perceptual Evaluation of Speech Quality (PESQ), Short-Time Objective Intelligibility (STOI), Signal to Noise Ratio (SNR), Signal to Distortion Ratio (SDR), Segmental SNR (Seg SNR) are considered for comparison.
  • 关键词:Non-Negative Matrix Factorization;CUP Estimator;Noise Reduction;PESQ
国家哲学社会科学文献中心版权所有