首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:Urdu to Punjabi Machine Translation: An Incremental Training Approach
  • 本地全文:下载
  • 作者:Umrinderpal Singh ; Vishal Goyal ; Gurpreet Singh Lehal
  • 期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
  • 印刷版ISSN:2158-107X
  • 电子版ISSN:2156-5570
  • 出版年度:2016
  • 卷号:7
  • 期号:4
  • DOI:10.14569/IJACSA.2016.070428
  • 出版社:Science and Information Society (SAI)
  • 摘要:The statistical machine translation approach is highly popular in automatic translation research area and promising approach to yield good accuracy. Efforts have been made to develop Urdu to Punjabi statistical machine translation system. The system is based on an incremental training approach to train the statistical model. In place of the parallel sentences corpus has manually mapped phrases which were used to train the model. In preprocessing phase, various rules were used for tokenization and segmentation processes. Along with these rules, text classification system was implemented to classify input text to predefined classes and decoder translates given text according to selected domain by the text classifier. The system used Hidden Markov Model(HMM) for the learning process and Viterbi algorithm has been used for decoding. Experiment and evaluation have shown that simple statistical model like HMM yields good accuracy for a closely related language pair like Urdu-Punjabi. The system has achieved 0.86 BLEU score and in manual testing and got more than 85% accuracy.
  • 关键词:thesai; IJACSA; thesai.org; journal; IJACSA papers; Machine Translation; Urdu to Punjabi Machine Translation; NLP; Urdu; Punjabi; Indo-Aryan Languages
国家哲学社会科学文献中心版权所有