文章基本信息

标题：Improving a Japanese-Spanish Machine Translation System Using Wikipedia Medical Articles
本地全文：下载
作者：Jessica C. Ramírez ; Yuji Matsumoto ; Darwin Muñoz 等
期刊名称：Computer Science & Information Technology
电子版ISSN：2231-5403
出版年度：2015
卷号：5
期号：4
页码：111-116
DOI：10.5121/csit.2015.50411
出版社：Academy & Industry Research Collaboration Center (AIRCC)
摘要：The quality, length and coverage of a parallel corpus are fundamental features in theperformance of a Statistical Machine Translation System (SMT). For some pair of languagesthere is a considerable lack of resources suitable for Natural Language Processing tasks. Thispaper introduces a technique for extracting medical information from the Wikipedia page.Using a medical ontological dictionary and then we evaluate on a Japanese-Spanish SMTsystem. The study shows an increment in the BLEU score.
关键词：Comparable Corpora; Dictionary; Ontology; Machine Translation