首页    期刊浏览 2024年11月27日 星期三
登录注册

文章基本信息

  • 标题:Sentiment Analysis of Lithuanian Texts Using Traditional and Deep Learning Approaches
  • 作者:Jurgita Kapočiūtė-Dzikienė ; Jurgita Kapočiūtė-Dzikienė ; Robertas Damaševičius
  • 期刊名称:Computers
  • 电子版ISSN:2073-431X
  • 出版年度:2019
  • 卷号:8
  • 期号:1
  • 页码:4
  • DOI:10.3390/computers8010004
  • 语种:English
  • 出版社:MDPI Publishing
  • 摘要:We describe the sentiment analysis experiments that were performed on the Lithuanian Internet comment dataset using traditional machine learning (Naïve Bayes Multinomial—NBM and Support Vector Machine—SVM) and deep learning (Long Short-Term Memory—LSTM and Convolutional Neural Network—CNN) approaches. The traditional machine learning techniques were used with the features based on the lexical, morphological, and character information. The deep learning approaches were applied on the top of two types of word embeddings (Vord2Vec continuous bag-of-words with negative sampling and FastText). Both traditional and deep learning approaches had to solve the positive/negative/neutral sentiment classification task on the balanced and full dataset versions. The best deep learning results (reaching 0.706 of accuracy) were achieved on the full dataset with CNN applied on top of the FastText embeddings, replaced emoticons, and eliminated diacritics. The traditional machine learning approaches demonstrated the best performance (0.735 of accuracy) on the full dataset with the NBM method, replaced emoticons, restored diacritics, and lemma unigrams as features. Although traditional machine learning approaches were superior when compared to the deep learning methods; deep learning demonstrated good results when applied on the small datasets.
  • 关键词:sentiment analysis; machine learning; deep learning; neural word embeddings; Internet comments; Lithuanian language sentiment analysis ; machine learning ; deep learning ; neural word embeddings ; Internet comments ; Lithuanian language
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有