首页    期刊浏览 2024年10月07日 星期一
登录注册

文章基本信息

  • 标题:Modified balanced random forest for improving imbalanced data prediction
  • 本地全文:下载
  • 作者:Zahra Putri Agusta ; Adiwijaya Adiwijaya
  • 期刊名称:IJAIN (International Journal of Advances in Intelligent Informatics)
  • 印刷版ISSN:2442-6571
  • 电子版ISSN:2548-3161
  • 出版年度:2019
  • 卷号:5
  • 期号:1
  • 页码:58-65
  • DOI:10.26555/ijain.v5i1.255
  • 语种:English
  • 出版社:Universitas Ahmad Dahlan
  • 摘要:This paper proposes a Modified Balanced Random Forest (MBRF) algorithm as a classification technique to address imbalanced data. The MBRF process changes the process in a Balanced Random Forest by applying an under-sampling strategy based on clustering techniques for each data bootstrap decision tree in the Random Forest algorithm. To find the optimal performance of our proposed method compared with four clustering techniques, like: K-MEANS, Spectral Clustering, Agglomerative Clustering, and Ward Hierarchical Clustering. The experimental result show the Ward Hierarchical Clustering Technique achieved optimal performance, also the proposed MBRF method yielded better performance compared to the Balanced Random Forest (BRF) and Random Forest (RF) algorithms, with a sensitivity value or true positive rate (TPR) of 93.42%, a specificity or true negative rate (TNR) of 93.60%, and the best AUC accuracy value of 93.51%. Moreover, MBRF also reduced process running time.
  • 关键词:Imbalanced data;Random forest algorithm;Balanced random forest;Customer churn;Classification technique
国家哲学社会科学文献中心版权所有