首页    期刊浏览 2024年11月25日 星期一
登录注册

文章基本信息

  • 标题:Optimizing the Hyperparameter of Feature Extraction and Machine Learning Classification Algorithms
  • 本地全文:下载
  • 作者:Sani Muhammad Isa ; Rizaldi Suwandi ; Yosefina Pricilia Andrean
  • 期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
  • 印刷版ISSN:2158-107X
  • 电子版ISSN:2156-5570
  • 出版年度:2019
  • 卷号:10
  • 期号:3
  • 页码:69-76
  • DOI:10.14569/IJACSA.2019.0100309
  • 出版社:Science and Information Society (SAI)
  • 摘要:The process of assigning a quantitative value to a piece of text expressing a mood or effect is called Sentiment analysis. Comparison of several machine learning, feature extraction approaches, and parameter optimization was done to achieve the best accuracy. This paper proposes an approach to extracting comparison value of sentiment review using three features extraction: Word2vec, Doc2vec, Terms Frequency-Inverse Document Frequency (TF-IDF) with machine learning classification algorithms, such as Support Vector Machine (SVM), Naive Bayes and Decision Tree. Grid search algorithm is used to optimize the feature extraction and classifier parameter. The performance of these classification algorithms is evaluated based on accuracy. The approach that is used in this research succeeded to increase the classification accuracy for all feature extractions and classifiers using grid search hyperparameter optimization on varied pre-processed data.
  • 关键词:Sentiment analysis; word2vec; TF-IDF (terms frequency-inverse document frequency); Doc2vec; grid search
国家哲学社会科学文献中心版权所有