首页    期刊浏览 2024年11月28日 星期四
登录注册

文章基本信息

  • 标题:Bangla Academic Dictionaries (BAD) Corpus with some Applications and Statistical Analysis
  • 本地全文:下载
  • 作者:MD. Abdul Awal Ansary ; Mohammad Reza Selim ; Muhammad Zafar Iqbal
  • 期刊名称:Journal of Emerging Trends in Computing and Information Sciences
  • 电子版ISSN:2079-8407
  • 出版年度:2012
  • 卷号:3
  • 期号:11
  • 页码:1562-1566
  • 出版社:ARPN Publishers
  • 摘要:The present era is a globally digital era and information/knowledge sharing is a common scenario now a days. But language difference is a great problem for sharing knowledge among different communities. To overcome this problem, different countries in the world have developed machine translators (MTs) such as English to French, English to Japanese, English to Spanish, English to Chinese, etc. However, there is no sound system that can translate English text into Bengali text. To develop a machine translator for English to Bengali translation and other linguistic purposes, first of all one needs a bilingual corpus between English and Bengali text. In this article, we have described the architecture to develop an English-Bengali bilingual corpus which is based on Bangla Academic Dictionaries (BAD) - English to Bengali (E2B) and Bengali to English (B2E) Dictionaries. We have used Bangla Academic dictionaries (BAD) because vocabularies are available in dictionaries from different genres. We have also mentioned different applications and statistical analysis of this BAD corpus.
  • 关键词:Parallel Corpus; POS tagging; Statistical Analysis; Corpus alignment; Machine Translation; NLP.
国家哲学社会科学文献中心版权所有