文章基本信息

标题：Sentiment analysis with covariate-assisted word embeddings
本地全文：下载
作者：Shirong Xu ; Ben Dai ; Junhui Wang 等
期刊名称：Electronic Journal of Statistics
印刷版ISSN：1935-7524
出版年度：2021
卷号：15
期号：1
页码：3015-3039
DOI：10.1214/21-EJS1854
语种：English
出版社：Institute of Mathematical Statistics
摘要：Sentiment analysis measures inclination of textual documents, aiming to extract and quantify their subjective sentiment polarity. In literature, most sentiment analysis methods first numericalize textual documents through certain word embeddings framework, and then formulate sentiment analysis as an ordinal regression or classification task. Yet it is often ignored that different people may have different preference of wording, and thus a uniform word embeddings often leads to suboptimal performance. In this article, to accommodate the heterogeneity among individual persons, we propose a covariate-assisted word embeddings in a margin-based ordinal regression framework, where covariates are incorporated through scaling factors to adjust the word embeddings. Moreover, we employ a block-wise coordinate descent scheme to tackle the resultant large-scale optimization task, and establish theoretical results to quantify the asymptotic behavior of the proposed method, guaranteeing its fast convergence rate in terms of prediction accuracy. Finally, we demonstrate the advantages of the proposed method over its competitors in both the Yelp Challenge dataset and synthetic datasets.
关键词：62H30; ordinal regression; Personalized prediction; sentiment analysis; unstructured data; word embeddings