首页    期刊浏览 2024年11月07日 星期四
登录注册

文章基本信息

  • 标题:Gender identification for Egyptian Arabic dialect in twitter using deep learning models
  • 本地全文:下载
  • 作者:Shereen ElSayed ; Mona Farouk
  • 期刊名称:Egyptian Informatics Journal
  • 印刷版ISSN:1110-8665
  • 出版年度:2020
  • 卷号:21
  • 期号:3
  • 页码:159-167
  • DOI:10.1016/j.eij.2020.04.001
  • 出版社:Elsevier
  • 摘要:Although the number of Arabic language writers in social media is increasing, the research work targeting Author Profiling (AP) is at the initial development phase. This paper investigates Gender Identification (GI) (male or female) of authors posting Egyptian dialect tweets using Neural Networks (NN) models. Various architectures of NN are explored with extensive parameters’ selection such as simple Artificial Neural Network (ANN), Convolutional Neural Network (CNN), Long–Short Term Memory (LSTM), Convolutional Bidirectional Long-Short Term Memory (C-Bi-LSTM) and Convolutional Bidirectional Gated Recurrent Units (C-Bi-GRU) NN which is tuned for the GI problem at hand. The best acquired GI accuracy using C-Bi-GRU multichannel model is 91.37%. It is worth noting that the presence of the bidirectional layer as well as the convolutional layer in the NN models has significantly enhanced the GI accuracy.
  • 关键词:Gender identification ; Egyptian Arabic text classification ; Deep learning ; Natural language processing ; Social Media analysis and mining
国家哲学社会科学文献中心版权所有