首页    期刊浏览 2024年10月07日 星期一
登录注册

文章基本信息

  • 标题:Implementation of Kadazan Tagger Based on Brill's Method
  • 作者:Marylyn Alex ; Lailatul Qadri Zakaria
  • 期刊名称:Journal of ICT Research and Applications
  • 印刷版ISSN:2337-5787
  • 电子版ISSN:2338-5499
  • 出版年度:2013
  • 卷号:7
  • 期号:3
  • 页码:177-190
  • 语种:English
  • 出版社:Institut Teknologi Bandung
  • 其他摘要:We present and evaluate the implementation of Part of Speech (POS) Tagging for the Kadazan language by using the Transformation-based approach. The main purpose of this study is to develop an automatic POS tagging for the Kadazan language, which had never, been developed before. POS tagging can tag the Kadazan corpus automatically and can help reduce the disambiguation problem of this language. The implementation of this approach in this study is to achieve a better and higher accuracy or at least similar to that of the other tagging approaches such as the statistical and the original rule-based approach. This approach can transform the tags based on the prescribed set of rules. A number of objectives were set in order to achieve the main purpose of this study. Firstly, to apply the lexical and contextual rules for this language. Secondly, to implement the Brill's algorithm based on the set of rules and finally to determine the effectiveness of the Kadazan Part of Speech by using this approach. The tagging system had been trained using four Kadazan corpuses containing 5663 words in all. Based on the evaluation results, the tagging system had achieved around 93% accuracy.
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有