首页    期刊浏览 2024年07月08日 星期一
登录注册

文章基本信息

  • 标题:Sentiment detection in micro-blogs using unsupervised chunk extraction
  • 本地全文:下载
  • 作者:Pierre Magistry ; Pierre Magistry ; Shu-Kai Hsieh
  • 期刊名称:Lingua Sinica
  • 电子版ISSN:2197-6678
  • 出版年度:2016
  • 卷号:2
  • 期号:1
  • 页码:1-10
  • DOI:10.1186/s40655-015-0010-8
  • 语种:English
  • 出版社:Springer
  • 摘要:Abstract In this paper, we present a proposed system designed for sentiment detection for micro-blog data in Chinese. Our system surprisingly benefits from the lack of word boundary in Chinese writing system and shifts the focus directly to larger and more relevant chunks. We use an unsupervised Chinese word segmentation system and binomial test to extract specific and endogenous lexicon chunks from the training corpus. We combine the lexicon chunks with other external resources to train a maximum entropy model for document classification. With this method, we obtained an averaged F1 score of 87.2 which outperforms the state-of-the-art approach based on the released data in the second SocialNLP shared task.
  • 关键词:Sentiment analysis;Emotion lexicon;Unsupervised learning
国家哲学社会科学文献中心版权所有