期刊名称:International Journal of Image, Graphics and Signal Processing
印刷版ISSN:2074-9074
电子版ISSN:2074-9082
出版年度:2019
卷号:11
期号:11
页码:36-42
DOI:10.5815/ijigsp.2019.11.05
出版社:MECS Publisher
摘要:In this work, a 5 state left to right HMM-based Bangla Isolated word speech recognizer has been developed. To train and test the recognizer, a small corpus of various sampling frequencies have been developed in noisy as well as the noiseless environment. The number of filter banks is varied during the feature extraction phase for both MFCC and PLP. The effects of 2nd and 3rd differential coefficients have also been observed. Experimental results exhibit that MFCC based feature extraction technique is better in CLASSROOM environment on the contrary PLP based technique performs better not only in a noiseless environment but also in when AC or FAN noise is present. We have also noticed that higher sampling frequency and higher filter order don’t always help to improve the performance.
关键词:MFCC;PLP;Clean and Noisy Environment;Different Sampling Rate;Different number of filter banks;HMM;Bangla ASR