首页    期刊浏览 2024年09月21日 星期六
登录注册

文章基本信息

  • 标题:A Comparison of Machine Learning Methods in a High-Dimensional Classification Problem
  • 本地全文:下载
  • 作者:Zekić-Sušac, Marijana ; Pfeifer, Sanja ; Šarlija, Nataša
  • 期刊名称:Business Systems Research
  • 印刷版ISSN:1847-8344
  • 电子版ISSN:1847-9375
  • 出版年度:2014
  • 卷号:5
  • 期号:3
  • 页码:82-96
  • DOI:10.2478/bsrj-2014-0021
  • 语种:English
  • 出版社:Udruga za promicanje poslovne informatike
  • 摘要:Background: Large-dimensional data modelling often relies on variable reduction methods in the pre-processing and in the post-processing stage. However, such a reduction usually provides less information and yields a lower accuracy of the model. Objectives: The aim of this paper is to assess the high-dimensional classification problem of recognizing entrepreneurial intentions of students by machine learning methods. Methods/Approach: Four methods were tested: artificial neural networks, CART classification trees, support vector machines, and k-nearest neighbour on the same dataset in order to compare their efficiency in the sense of classification accuracy. The performance of each method was compared on ten subsamples in a 10-fold cross-validation procedure in order to assess computing sensitivity and specificity of each model. Results: The artificial neural network model based on multilayer perceptron yielded a higher classification rate than the models produced by other methods. The pairwise t-test showed a statistical significance between the artificial neural network and the k-nearest neighbour model, while the difference among other methods was not statistically significant. Conclusions: Tested machine learning methods are able to learn fast and achieve high classification accuracy. However, further advancement can be assured by testing a few additional methodological refinements in machine learning methods.
  • 关键词:machine learning; support vector machines; artificial neural networks; CART classification trees; k-nearest neighbour; large-dimensional data; cross-validation
国家哲学社会科学文献中心版权所有