首页    期刊浏览 2024年11月27日 星期三
登录注册

文章基本信息

  • 标题:Performance Analysis of Statistical and Supervised Learning Techniques in Stock Data Mining
  • 本地全文:下载
  • 作者:Manik Sharma ; Samriti Sharma ; Gurvinder Singh
  • 期刊名称:Data
  • 印刷版ISSN:2306-5729
  • 出版年度:2018
  • 卷号:3
  • 期号:4
  • 页码:54-69
  • DOI:10.3390/data3040054
  • 出版社:MDPI Publishing
  • 摘要:Nowadays, overwhelming stock data is available, which areonly of use if it is properly examined and mined. In this paper, the last twelve years of ICICI Bank’s stock data have been extensively examined using statistical and supervised learning techniques. This study may be of great interest for those who wish to mine or study the stock data of banks or any financial organization. Different statistical measures have been computed to explore the nature, range, distribution, and deviation of data. The different descriptive statistical measures assist in finding different valuable metrics such as mean, variance, skewness, kurtosis, p-value, a-squared, and 95% confidence mean interval level of ICICI Bank’s stock data. Moreover, daily percentage changes occurring over the last 12 years have also been recorded and examined. Additionally, the intraday stock status has been mined using ten different classifiers. The performance of different classifiers has been evaluated on the basis of various parameters such as accuracy, misclassification rate, precision, recall, specificity, and sensitivity. Based upon different parameters, the predictive results obtained using logistic regression are more acceptable than the outcomes of other classifiers, whereas naïve Bayes, C4.5, random forest, linear discriminant, and cubic support vector machine (SVM) merely act as a random guessing machine. The outstanding performance of logistic regression has been validated using TOPSIS (technique for order preference by similarity to ideal solution) and WSA (weighted sum approach).
  • 关键词:stock forecasting; naïve Bayes; C4.5; random forest; logistic regression; support vector machine stock forecasting ; naïve Bayes ; C4.5 ; random forest ; logistic regression ; support vector machine
国家哲学社会科学文献中心版权所有