首页    期刊浏览 2024年09月22日 星期日
登录注册

文章基本信息

  • 标题:The convolutional neural networks for Amazigh speech recognition syste
  • 本地全文:下载
  • 作者:Meryam Telmem ; Youssef Ghanou
  • 期刊名称:TELKOMNIKA (Telecommunication Computing Electronics and Control)
  • 印刷版ISSN:2302-9293
  • 出版年度:2021
  • 卷号:19
  • 期号:2
  • DOI:10.12928/telkomnika.v19i2.16793
  • 语种:English
  • 出版社:Universitas Ahmad Dahlan
  • 摘要:In this paper, we present an approach based on convolutional neural networks to build an automatic speech recognition system for the Amazigh language. This system is built with TensorFlow and uses mel frequency cepstral coefficient (MFCC) to extract features. In order to test the effect of the speaker's gender and age on the accuracy of the model, the system was trained and tested on several datasets. The first experiment the dataset consists of 9240 audio files. The second experiment the dataset consists of 9240 audio files distributed between females and males’ speakers. The last experiment 3 the dataset consists of 13860 audio files distributed between age 9-15, age 16-30, and age 30+. The result shows that the model trained on a dataset of adult speaker’s age +30 categories generates the best accuracy with 93.9%.
  • 关键词:Amazigh language;convolutional neural network;deep learning;mel frequency cepstral coefficient;spectrogram;speech recognition
国家哲学社会科学文献中心版权所有