首页    期刊浏览 2025年04月21日 星期一
登录注册

文章基本信息

  • 标题:Towards Kikamba Computational Grammar
  • 本地全文:下载
  • 作者:Benson Kituku ; Wanjiku Nganga ; Lawrence Muchemi
  • 期刊名称:Journal of Data Analysis and Information Processing
  • 印刷版ISSN:2327-7211
  • 电子版ISSN:2327-7203
  • 出版年度:2019
  • 卷号:7
  • 期号:4
  • 页码:250-275
  • DOI:10.4236/jdaip.2019.74015
  • 语种:English
  • 出版社:Scientific Research Publishing
  • 摘要:The under-resourced Kikamba language has few language technology tools since the more efficient and popular data driven approaches for developing them suffer from data sparseness due to lack of digitized corpora. To address this challenge, we have developed a computational grammar for the Kikamba language within the multilingual Grammatical Framework (GF) toolkit. GF uses the Interlingua rule-based translation approach. To develop the grammar, we used the morphology driven strategy. Therefore, we first developed regular expressions for morphology inflection and thereafter developed the syntax rules. Evaluation of the grammar was done using one hundred sentences in both English and Kikamba languages. The results were an encouraging four n-gram BLEU score of 83.05% and the Position independent error rate (PER) of 10.96%. Finally, we have made a contribution to the language technology resources for Kikamba including multilingual machine translation, a morphology analyzer, a computational grammar which provides a platform for development of multilingual applications and the ability to generate a variety of bilingual corpora for Kikamba for all languages currently defined in GF, making it easier to experiment with data driven approaches.
  • 关键词:Grammar;Morphology;Syntax;Grammatical Framework;Under-Resourced language;Concord;Multilingual;Agglutination;Kikamba
国家哲学社会科学文献中心版权所有