文章基本信息

标题：Forty years of working with corpora: from Ibsen to Twitter, and beyond
本地全文：下载
作者：Knut Hofland ; Paul Meurer ; Andrew Salway 等
期刊名称：Bergen Language and Linguistics Studies
印刷版ISSN：1892-2449
出版年度：2013
卷号：3
期号：1
页码：9-22
DOI：10.15845/bells.v3i1.371
出版社：The University of Bergen
摘要：We provide an overview of forty years of work with language corpora by the research group that started in 1972 as the Norwegian Computing Centre for the Humanities.A brief history highlights major corpora and tools that have been developed in numerous collaborations, including corpora of literature, dialect recordings, learner language, parallel texts, newspaper articles, blog posts and tweets.Current activities are also described, with a focus on corpus analysis tools, treebanks and social media analysis.
关键词：corpus!building;!corpus!analysis!tools;!treebanks;!social!media!analysis