首页    期刊浏览 2025年02月23日 星期日
登录注册

文章基本信息

  • 标题:The Index Thomisticus Treebank Project : Annotation, Parsing and Valency Lexicon
  • 本地全文:下载
  • 作者:Barbara McGillivray ; Marco Passarotti ; Paolo Ruffolo
  • 期刊名称:Traitement Automatique des Langues
  • 印刷版ISSN:1248-9433
  • 电子版ISSN:1965-0906
  • 出版年度:2009
  • 卷号:50
  • 期号:2
  • 出版社:ATALA - Assoc Traitement Automatique Langues
  • 摘要:We present an overview of the Index Thomisticus Treebank project (IT-TB). The IT-TB consists of around 60,000 tokens from the Index Thomisticus by Roberto Busa SJ, an 11-million-token Latin corpus of the texts by Thomas Aquinas. We briefly describe the annotation guidelines, shared with the Latin Dependency Treebank (LDT). The application of data-driven dependency parsers on IT-TB and LDT data is reported on. We present training and parsing results on several datasets and provide evaluation of learning algorithms and techniques. Furthermore, we introduce the IT-TB valency lexicon extracted from the treebank. We report on quantitative data of the lexicon and provide some statistical measures on subcategorisation structures.
国家哲学社会科学文献中心版权所有