首页    期刊浏览 2024年09月19日 星期四
登录注册

文章基本信息

  • 标题:Dictionary Writing System (DWS) + Corpus Query Package (CQP): The Case of "TshwaneLex"
  • 本地全文:下载
  • 作者:G-M de Schryver ; G De Pauw
  • 期刊名称:Lexikos
  • 印刷版ISSN:1684-4904
  • 电子版ISSN:2224-0039
  • 出版年度:2007
  • 卷号:17
  • 期号:1
  • DOI:10.4314/lex.v17i1.51535
  • 语种:English
  • 出版社:Bureau of the WAT
  • 摘要:In this article the integrated corpus query functionality of the dictionary compilation software TshwaneLex is analysed. Attention is given to the handling of both raw corpus data and annotated corpus data. With regard to the latter it is shown how, with a minimum of human effort, machine learning techniques can be employed to obtain part-of-speech tagged corpora that can be used for lexicographic purposes. All points are illustrated with data drawn from English and Northern Sotho. The tools and techniques themselves, however, are language-independent, and as such the encouraging outcomes of this study are far-reaching. Keywords: lexicography, dictionary, software, dictionary writing sys-tem (dws), corpus query package (cqp), tshwanelex, corpus, corpus anno-tation, part-of-speech tagger (pos-tagger), machine learning, northern sotho (sesotho sa leboa)
国家哲学社会科学文献中心版权所有