首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:POS Tagging for Amharic: A Machine Learning Approach
  • 本地全文:下载
  • 作者:Sintayehu Hirpassa Kefena ; Gurpreet Singh Lehal
  • 期刊名称:INFOCOMP
  • 印刷版ISSN:1807-4545
  • 出版年度:2020
  • 卷号:19
  • 期号:1
  • 出版社:Federal University of Lavras
  • 其他摘要:In this paper, our focus is the problem of automatic prediction of Parts of Speech of words in Amharic language sentence. We present an experiment that involves the study and implementation of POS tagging model. Four statistical taggers, i.e. Trigrams’n’Tags (TnT) Tagger, Conditional Random Field taggers (CRF), Naive Bays (NB) and Decision Tree (DT) classifier is applying for a morphologically rich language: Amharic. We compare the performances of all taggers with the same size of training and testing Dataset. Various types of language-dependent and independent feature set have formed, and for each algorithm, a combination of them is applied. Based on such inputs CRF based model has achieved outperformed accuracy. The best accuracy obtained from our experiment is 94.08%. Finally, our study shows that linguistic features play a decisive part in overcoming the limitations of the baseline statistical model for Amharic languages.
国家哲学社会科学文献中心版权所有