文章基本信息

标题：Robust Speech Detection using SEM and SFN
本地全文：下载
作者：In-Sung Han ; Chan-Shik Ahn
期刊名称：International Journal of Multimedia and Ubiquitous Engineering
印刷版ISSN：1975-0080
出版年度：2014
卷号：9
期号：9
页码：61-68
DOI：10.14257/ijmue.2014.9.9.07
出版社：SERSC
摘要：Speech recognition, the problem of performance degradation is the difference between the model training and recognition environments. Silence features normalized using the method as a way to reduce the inconsistency of such an environment. Silence features normalized way of existing in the low signal-to-noise ratio. Increase the energy level of the silence interval for speech and non-speech classification accuracy due to the falling. There is a problem in the recognition performance is degraded. This paper proposed a robust speech detection method in noisy environments using a SFN (silence feature normalization) and SEM (speech energy maximize). In the high signal-to-noise ratio for the proposed method was used to maximize the characteristics receive less characterized the effects of noise by the speech energy. Cepstral feature distribution of speech and non-speech characteristics in the low signal-to- noise ratio and improves the recognition performance. Result of the recognition experiment, recognition performance improved compared to the conventional method.
关键词：Speech Recognition; Voice Detection; Noise Reduction; Speech Energy ; Maximization; Silence Feature Normalization