期刊名称:BVICAM's International Journal of Information Technology
印刷版ISSN:0973-5658
出版年度:2013
卷号:5
期号:1
语种:English
出版社:Bharati Vidyapeeth's Institute of Computer Applications and Management
摘要:Speech recognition is the ability of a machine or program to convert spoken words into its equivalent text form. Nowadays, most recognition systems use Hidden Markov Models for modeling the spoken utterances. In this paper we have implemented two speaker independent speech recognition systems which include all the words required for dialing a phone. The systems contain 42 words including digits from zero to nine and also include names of 20 persons. A total of 16,800 utterances have been used for training each system. The two systems are able to recognize continuous speech and it is implemented with the help of monophones and triphones using HTK. Experimental results show an accuracy of 74.11% for monophones based models and 93.77% for triphones based models.
关键词:Index Terms - HMM;HTK;Monophones;Triphones;Mel Frequency Cepstral Coefficient (MFCC).