首页    期刊浏览 2024年11月28日 星期四
登录注册

文章基本信息

  • 标题:A SIMPLIFIED APPROACH TO WORD ALIGNMENT ALGORITHM FOR ENGLISH-TAMIL TRANSLATION
  • 本地全文:下载
  • 作者:R. Harshawardhan ; Mridula Sara Augustine ; Dr K. P. Soman
  • 期刊名称:Indian Journal of Computer Science and Engineering
  • 印刷版ISSN:2231-3850
  • 电子版ISSN:0976-5166
  • 出版年度:2011
  • 卷号:2
  • 期号:1
  • 页码:94-100
  • 出版社:Engg Journals Publications
  • 摘要:In this paper, a recently proposed word alignment algorithm is simplified for easy understanding and tested for an Indian language. The word alignment problem is viewed as a simple assignment problem and is formulated as an Integer Linear Programming problem. The new objective function defined is tested for obtaining optimal alignment for English-Tamil translation pair. This alignment is necessary for creating the probabilistic bilingual dictionary and is also required for automatic machine translation. We have used this objective function to align words in 25 sentences of English-Tamil parallel corpora. The formulation is solved using the open source LP-Solver. Result obtained indicates that the methodology is applicable for all Indian languages. The work implemented is useful for pedagogical purposes, as it is a standard problem in computational linguistics. Accuracy of modern statistical machine translation depends on good word alignment. The document of the formulated model is available on request.
  • 关键词:Word alignment; assignment problem; integer linear programming; alignment variables and dictionary variables.
国家哲学社会科学文献中心版权所有