首页    期刊浏览 2024年10月04日 星期五
登录注册

文章基本信息

  • 标题:Transliterating Nôm Scripts into Vietnamese National Scripts using Statistical Machine Translation
  • 本地全文:下载
  • 作者:Dien Dinh ; Phuong Nguyen ; Long H. B. Nguyen
  • 期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
  • 印刷版ISSN:2158-107X
  • 电子版ISSN:2156-5570
  • 出版年度:2021
  • 卷号:12
  • 期号:2
  • 页码:37-45
  • DOI:10.14569/IJACSA.2021.0120205
  • 出版社:Science and Information Society (SAI)
  • 摘要:Nôm scripts were used as the Vietnamese writing system from the 10th century to the early 20th century. During this period, Nôm scripts were the means to record a broad range of historical events, literary works, medical knowledge, as well as wisdom of many other domains. Unfortunately, since hardly any native Vietnamese speaker can read Nôm scripts nowadays, these valuable documents have not been fully harnessed. To address this gap, it is necessary to build an automatic transliteration system that can support us in decoding the ancient scripts and gaining knowledge of our Vietnamese ancestors. This study focuses on categorizing and reviewing the current progress on the Statistical Machine Translation (SMT) approaches to transliterate Nôm scripts into Vietnamese national scripts. In this paper, we discuss the differences between Nôm scripts and Vietnamese national scripts, systematically compare SMT models in transliterating Nôm scripts into Vietnamese national scripts, as well as having a thorough outlook on several promising research directions.
  • 关键词:Statistical machine translation; automatic translit-eration; Nôm Script (chú Nôm); vietnamese national script (chú Quèc ngú)
国家哲学社会科学文献中心版权所有