首页    期刊浏览 2024年10月01日 星期二
登录注册

文章基本信息

  • 标题:Neural architectures for gender detection and speaker identification
  • 本地全文:下载
  • 作者:Orken Mamyrbayev ; Alymzhan Toleu ; Gulmira Tolegen
  • 期刊名称:Cogent Engineering
  • 电子版ISSN:2331-1916
  • 出版年度:2020
  • 卷号:7
  • 期号:1
  • 页码:1-13
  • DOI:10.1080/23311916.2020.1727168
  • 出版社:Taylor and Francis Ltd
  • 摘要:In this paper, we investigate two neural architecture for gender detection and speaker identification tasks by utilizing Mel-frequency cepstral coefficients (MFCC) features which do not cover the voice related characteristics. One of our goals is to compare different neural architectures, multi-layers perceptron (MLP) and, convolutional neural networks (CNNs) for both tasks with various settings and learn the gender/speaker-specific features automatically. The experimental results reveal that the models using z-score and Gramian matrix transformation obtain better results than the models only use max-min normalization of MFCC. In terms of training time, MLP requires large training epochs to converge than CNN. Other experimental results show that MLPs outperform CNNs for both tasks in terms of generalization errors.
  • 关键词:MLP; CNN; gender detection; speaker identification
国家哲学社会科学文献中心版权所有