首页    期刊浏览 2024年09月19日 星期四
登录注册

文章基本信息

  • 标题:Noise or music? Investigating the usefulness of normalisation for robust sentiment analysis on social media data
  • 本地全文:下载
  • 作者:Cynthia Van Hee ; Marjan Van de Kauter ; Orphée De Clercq
  • 期刊名称:Traitement Automatique des Langues
  • 印刷版ISSN:1248-9433
  • 电子版ISSN:1965-0906
  • 出版年度:2017
  • 卷号:58
  • 期号:1
  • 页码:1-25
  • 语种:French
  • 出版社:ATALA - Assoc Traitement Automatique Langues
  • 其他摘要:In the past decade, sentiment analysis research has thrived, especially on social media. While this data genre is suitable to extract opinions and sentiment, it is known to be noisy. Complex normalisation methods have been developed to transform noisy text into its standard form, but their effect on tasks like sentiment analysis remains underinvestigated. Sentiment analysis approaches mostly include spell checking or rule-based normalisation as preprocessing and rarely investigate its impact on the task performance. We present an optimised sentiment classifier and investigate to what extent its performance can be enhanced by integrating SMT-based normalisation as preprocessing. Experiments on a test set comprising a variety of user-generated content genres revealed that normalisation improves sentiment classification performance on tweets and blog posts, showing the model’s ability to generalise to other data genres.
国家哲学社会科学文献中心版权所有