首页    期刊浏览 2024年09月15日 星期日
登录注册

文章基本信息

  • 标题:One Solution of Extension of Mel-Frequency Cepstral Coefficients Feature Vector for Automatic Speaker Recognition
  • 其他标题:One Solution of Extension of Mel-Frequency Cepstral Coefficients Feature Vector for Automatic Speaker Recognition
  • 本地全文:下载
  • 作者:Ivan Jokić ; Stevan Jokić ; Vlado Delić
  • 期刊名称:European Integration Studies
  • 印刷版ISSN:2335-8831
  • 出版年度:2020
  • 卷号:49
  • 期号:2
  • 页码:224-236
  • DOI:10.5755/j01.itc.49.2.22258
  • 出版社:Kaunas University of Technology
  • 摘要:One extension of feature vector for automatic speaker recognition is considered in this paper. The starting feature vector consisted of 18 mel-frequency cepstral coefficients (MFCCs). Extension was done with two additional features derived from the spectrum of the speech signal. The main idea that generated this research is that it is possible to increase the efficiency of automatic speaker recognition by constructing a feature vector which tracks a real perceived spectrum in the observed speech. Additional features are based on the energy maximums in the appropriate frequency ranges of observed speech frames. In experiments, accuracy and equal error rate (EER) are compared in the case when feature vectors contain only 18 MFCCs and in cases when additional features are used. Recognition accuracy increased by around 3%. Values of EER show smaller differentiation but the results show that adding proposed additional features produced a lower decision threshold. These results indicate that tracking of real occurrences in the spectrum of the speech signal leads to more efficient automatic speaker recognizer. Determining features which track real occurrences in the speech spectrum will improve the procedure of automatic speaker recognition and enable avoiding complex models.
  • 关键词:Speaker recognition, spectrum, mel-frequency cepstral coefficients, energy, maximum.
国家哲学社会科学文献中心版权所有