首页    期刊浏览 2025年05月01日 星期四
登录注册

文章基本信息

  • 标题:Lemmatization for Ancient Greek
  • 本地全文:下载
  • 作者:Alessandro Vatri ; Barbara McGillivray
  • 期刊名称:Journal of Greek Linguistics
  • 印刷版ISSN:1566-5844
  • 电子版ISSN:1569-9846
  • 出版年度:2020
  • 卷号:20
  • 期号:2
  • 页码:179-196
  • DOI:10.1163/15699846-02002001
  • 出版社:BRILL
  • 摘要:This article presents the result of accuracy tests for currently available Ancient Greek lemmatizers and recently published lemmatized corpora. We ran a blinded experiment in which three highly proficient readers of Ancient Greek evaluated the output of the CLTK lemmatizer, of the CLTK backoff lemmatizer, and of GLEM , together with the lemmatizations offered by the Diorisis corpus and the Lemmatized Ancient Greek Texts repository. The texts chosen for this experiment are Homer, Iliad 1.1–279 and Lysias 7. The results suggest that lemmatization methods using large lexica as well as part-of-speech tagging—such as those employed by the Diorisis corpus and the CLTK backoff lemmatizer—are more reliable than methods that rely more heavily on machine learning and use smaller lexica.
  • 关键词:Ancient Greek digital corpora;Ancient Greek morphology;lemmatization;lemmatizer;usability of digital resources;digital lexicography
国家哲学社会科学文献中心版权所有