摘要:In recent years, Internet provides a unique opportunity to express and spread public sentiment, which makes the web contents becoming the largest information source of public sentiment. Since web public sentiment reflects people’s attitude to society and politics, the public opinion’s orientation is significant to decision-makers. In this paper, we utilize VSM (vector space model) to present the text orientation of web information and offer data-mining approaches to analyze public opinion’s orientation, which can assist decision-makers to steer social information and guide the web public sentiment. To achieve the goal of text orientation analysis, two ways are proposed. Firstly, a novel text orientation analysis method is described to analyze the orientation of original web postings and their replies. Secondly, an improved single-pass clustering algorithm is introduced to cluster the subject of web discussion and discover the hot topics.We also construct a prototype system, named WPSAS (web public sentiment analysis system), as experimental platform to validate the presented methodology. The experimental results show that our methods are effective and efficient.
关键词:web public sentiment;orientation analysis;opinion mining;clustering;VSM