期刊名称:Chimera: Romance Corpora and Linguistic Studies
电子版ISSN:2386-2629
出版年度:2018
卷号:5
期号:1
页码:111-116
DOI:10.15366/chimera2018.5.1.009
出版社:Universidad Autónoma de Madrid
摘要:The term Corpus Reengineering is proposed to refer to the process of reconfiguration of speech samples for reuse in various scopes (Guirado 2014 y 2015). The process was developed with two corpora of Spanish spoken in Caracas, aiming at the construction a special purpose corpus for diachronic studies: Corpus del habla culta de Caracas 1968-77 (cf. Rosenblat & Bentivoglio 1979) y el Corpus sociolingüístico de Caracas PRESEEA 2004-10 (cf. Bentivoglio & Malaver 2014). The methodology included the following steps: i. describe the structuring of the original corpus; ii. evaluate the inoperative and stable aspects of each architecture; and, iii. create the new design and estimate its representativeness. Such methodology produced a new corpus for the real time study of linguistic phenomena in a specific speech community (cultured speakers): Habla culta de Caracas 1973-2011. Corpus diacrónico . It is concluded that Corpus Reengineering is a rewarding practice for Corpus Linguistics that provides useful and interesting products for the linguistic community.
关键词:Corpus Reengineering; special purpose corpus; corpus linguistics; Spanish
spoken in Caracas; diachronic study.