首页    期刊浏览 2025年04月23日 星期三
登录注册

文章基本信息

  • 标题:LegalNERCwith ontologies,Wikipedia and curriculum learning
  • 本地全文:下载
  • 作者:Cristian Cardellino ; Milagro Teruel ; Laura Alonso Alemany
  • 期刊名称:Conference on European Chapter of the Association for Computational Linguistics (EACL)
  • 出版年度:2017
  • 卷号:2017
  • 页码:254-259
  • 语种:English
  • 出版社:ACL Anthology
  • 摘要:In this paper, we present a Wikipedia-based approach to develop resources for the legal domain. We establish a mapping between a legal domain ontology, LKIF (Hoekstra et al. 2007), and a Wikipedia-based ontology, YAGO (Suchanek et al. 2007), and through that we populate LKIF. Moreover, we use the mentions of those entities in Wikipedia text to train a specific Named Entity Recognizer and Classifier. We find that this classifier works well in the Wikipedia, but, as could be expected, performance decreases in a corpus of judgments of the European Court of Human Rights. However, this tool will be used as a preprocess for human annotation. We resort to a technique called “curriculum learning” aimed to overcome problems of overfitting by learning increasingly more complex concepts. However, we find that in this particular setting, the method works best by learning from most specific to most general concepts, not the other way round.
国家哲学社会科学文献中心版权所有