首页    期刊浏览 2025年04月15日 星期二
登录注册

文章基本信息

  • 标题:Part-of-Speech Tagging via Deep Neural Networks for Northern-Ethiopic Languages
  • 本地全文:下载
  • 作者:Jurgita Kapočiūtė-Dzikienė ; Senait Gebremichael Tesfagergish
  • 期刊名称:Engineering Economics
  • 印刷版ISSN:2029-5839
  • 出版年度:2020
  • 卷号:49
  • 期号:4
  • 页码:482-494
  • DOI:10.5755/j01.itc.49.4.26808
  • 出版社:Kaunas University of Technology
  • 摘要:Deep Neural Networks (DNNs) have proven to be especially successful in the area of Natural Language Processing (NLP) and Part-Of-Speech (POS) tagging—which is the process of mapping words to their corresponding POS labels depending on the context. Despite recent development of language technologies, low-resourced languages (such as an East African Tigrinya language), have received too little attention. We investigate the effectiveness of Deep Learning (DL) solutions for the low-resourced Tigrinya language of the Northern-Ethiopic branch. We have selected Tigrinya as the testbed example and have tested state-of-the-art DL approaches seeking to build the most accurate POS tagger. We have evaluated DNN classifiers (Feed Forward Neural Network – FFNN, Long Short-Term Memory method – LSTM, Bidirectional LSTM, and Convolutional Neural Network – CNN) on a top of neural word2vec word embeddings with a small training corpus known as Nagaoka Tigrinya Corpus. To determine the best DNN classifier type, its architecture and hyper-parameter set both manual and automatic hyper-parameter tuning has been performed. BiLSTM method was proved to be the most suitable for our solving task: it achieved the highest accuracy equal to 92% that is 65% above the random baseline.
  • 关键词:Deep Learning;word2vec embeddings;part-of-speech tagging;natural language processing;computational linguistics;Tigrinya language.
国家哲学社会科学文献中心版权所有