首页    期刊浏览 2024年11月28日 星期四
登录注册

文章基本信息

  • 标题:Extraction and Presentation of Bilingual Correspondences from Slovak-Bulgarian Parallel Corpus
  • 本地全文:下载
  • 作者:RADOVAN GARABÍK ; RADOVAN GARABÍK ; LUDMILA DIMITROVA
  • 期刊名称:Cognitive Studies | Études cognitives
  • 印刷版ISSN:2080-7147
  • 电子版ISSN:2392-2397
  • 出版年度:2015
  • 期号:15
  • 页码:327-334
  • DOI:10.11649/cs.2015.022
  • 语种:English
  • 出版社:Institute of Slavic Studies, Polish Academy of Sciences
  • 摘要:In this paper the results of the automatic extraction and presentation of bilingual correspondences from Slovak-Bulgarian Parallel corpus are described. The equivalent phrases are extracted from sentence and word level automatically aligned corpus, filtered, indexed and presented in a dictionary-like interface. The bilingual dictionary database contains 80 thousand phrase pairs consisting of approximately 350 thousand words (per each language). Counting unique word forms, the size is 31 thousand in the Slovak part of the dictionary, 26 thousand in the Bulgarian part.
  • 关键词:translation equivalents; GIZA++; parallel corpora; aligned text; Slovak; Bulgarian
国家哲学社会科学文献中心版权所有