首页    期刊浏览 2024年09月18日 星期三
登录注册

文章基本信息

  • 标题:Corpus-based Lexicography for Lesser-resourced Languages — Maximizing the Limited Corpus
  • 本地全文:下载
  • 作者:DJ Prinsloo
  • 期刊名称:Lexikos
  • 印刷版ISSN:1684-4904
  • 电子版ISSN:2224-0039
  • 出版年度:2015
  • 卷号:25
  • 期号:0
  • 页码:285-300
  • 出版社:Bureau of the WAT
  • 摘要:This article focuses on lesser-resourced languages for which only very limited corpora are available and how such relatively small and often unbalanced, raw corpora could be maximally utilized for lexicographic purposes to obtain similar results as for bigger corpora. Sepedi and Afri-kaans will be studied in this regard. The aim is to determine to what extent enlarging a corpus from e.g. one to 10 million, and from 10 million to 100 million words enhances its potential for (a) macro-structure compilation, (b) sourcing information on the most important microstructural aspects and (c) the creation of lexicographic tools. It will be argued that valuable and even sufficient data for the compilation of a specific dictionary can be extracted from a relatively small corpus of approxi-mately one million words but that "bigger" in some instances indeed means "better". Keywords: Corpus-based lexicography, lesser-resourced languages, limited corpora, corpus tools, lexicographic tools
  • 关键词:Corpus-based lexicography, lesser-resourced languages, limited corpora, corpus tools, lexicographic tools
国家哲学社会科学文献中心版权所有