首页    期刊浏览 2024年11月23日 星期六
登录注册

文章基本信息

  • 标题:Sentiment analysis with covariate-assisted word embeddings
  • 本地全文:下载
  • 作者:Shirong Xu ; Ben Dai ; Junhui Wang
  • 期刊名称:Electronic Journal of Statistics
  • 印刷版ISSN:1935-7524
  • 出版年度:2021
  • 卷号:15
  • 期号:1
  • 页码:3015-3039
  • DOI:10.1214/21-EJS1854
  • 语种:English
  • 出版社:Institute of Mathematical Statistics
  • 摘要:Sentiment analysis measures inclination of textual documents, aiming to extract and quantify their subjective sentiment polarity. In literature, most sentiment analysis methods first numericalize textual documents through certain word embeddings framework, and then formulate sentiment analysis as an ordinal regression or classification task. Yet it is often ignored that different people may have different preference of wording, and thus a uniform word embeddings often leads to suboptimal performance. In this article, to accommodate the heterogeneity among individual persons, we propose a covariate-assisted word embeddings in a margin-based ordinal regression framework, where covariates are incorporated through scaling factors to adjust the word embeddings. Moreover, we employ a block-wise coordinate descent scheme to tackle the resultant large-scale optimization task, and establish theoretical results to quantify the asymptotic behavior of the proposed method, guaranteeing its fast convergence rate in terms of prediction accuracy. Finally, we demonstrate the advantages of the proposed method over its competitors in both the Yelp Challenge dataset and synthetic datasets.
  • 关键词:62H30; ordinal regression; Personalized prediction; sentiment analysis; unstructured data; word embeddings
国家哲学社会科学文献中心版权所有