首页    期刊浏览 2024年09月19日 星期四
登录注册

文章基本信息

  • 标题:A Survey on Large Scale Corpora and Emotion Corpora
  • 本地全文:下载
  • 作者:Michal Ptaszynski ; Rafal Rzepka ; Satoshi Oyama
  • 期刊名称:Information and Media Technologies
  • 电子版ISSN:1881-0896
  • 出版年度:2014
  • 卷号:9
  • 期号:4
  • 页码:429-445
  • DOI:10.11185/imt.9.429
  • 出版社:Information and Media Technologies Editorial Board
  • 摘要:In this paper we present a survey on natural language corpora, with particular focus on corpora of large scale and those applicable to sentiment analysis. Natural language corpora are crucial for training various Software Engineering applications, from part-of-speech taggers and dependency parsers to dialog systems or sentiment analysis software. We compare several natural language corpora created for different languages, analyze their distinctive features and the amount of additional annotations provided by the developers of those corpora.
国家哲学社会科学文献中心版权所有