首页    期刊浏览 2025年12月28日 星期日
登录注册

文章基本信息

  • 标题:Creation of a High-quality, Register-diversified Parallel (English-Spanish) Corpus for Linguistic and Computational Investigations
  • 本地全文:下载
  • 作者:Julia Lavid ; Julia Lavid ; Jorge Arús
  • 期刊名称:Procedia - Social and Behavioral Sciences
  • 印刷版ISSN:1877-0428
  • 出版年度:2015
  • 卷号:198
  • 页码:249-256
  • DOI:10.1016/j.sbspro.2015.07.443
  • 语种:English
  • 出版社:Elsevier
  • 摘要:AbstractThis paper outlines current work on the construction of a high-quality, richly-annotated and register-diversified parallel corpus for the English-Spanish language pair, as currently carried out within the framework of the MULTINOT project. The corpus consists of original and translated texts in both directions and is designed as a multifunctional resource to be used in a number of disciplines such as corpus-based contrastive linguistic and translation studies, machine translation, computer-assisted translation, computer-assisted language learning and terminology extraction. The paper describes the structure of the corpus –which includes four subcorpora: English originals (EO) and Spanish originals (SO), English translations (Etrans) and Spanish translations (Strans)-, the registers selected for inclusion in the corpus, and the methodology used to guarantee the quality of the processing steps to enrich the corpus with linguistic information at different levels.
  • 关键词:Corpus creation;corpus annotation;English;Spanish.
国家哲学社会科学文献中心版权所有