文章基本信息

标题：A Novel Approach to Speech Enhancement Based on Deep Neural Networks
本地全文：下载
作者：SALEHI, M. ; MIRZAKUCHAKI, S.
期刊名称：Advances in Electrical and Computer Engineering
印刷版ISSN：1582-7445
电子版ISSN：1844-7600
出版年度：2022
卷号：22
期号：2
页码：71-78
DOI：10.4316/AECE.2022.02009
语种：English
出版社：Universitatea "Stefan cel Mare" Suceava
摘要：Minimum mean-square error (MMSE) approaches have been shown to achieve state-of-the-art performance on the task of speech enhancement. However, MMSE approaches lack the ability to accurately estimate non-stationary noise sources. In this paper, a long short-term memory fully convolutional network (LSTM-FCN) is utilized to accurately estimate a priori signal-to-noise ratio (SNR) since the speech enhancement performance of an MMSE approach improves with the accuracy of the used a priori SNR estimator. The proposed MMSE approach makes no assumptions about the characteristics of the noise or the speech. MMSE approaches that utilize the LSTM-FCN estimator are evaluated using the mean opinion score of the perceptual evaluation of speech quality (PESQ) and the short-time objective intelligibility (STOI) measures of speech. The experimental investigation shows that the speech enhancement performance of an MMSE approach that utilizes LSTM-FCN estimator significantly increases.
关键词：Index Terms—long short-term memory;machine learning; mean square error methods;recurrent neural networks;speech enhancement.