首页    期刊浏览 2024年10月05日 星期六
登录注册

文章基本信息

  • 标题:A Novel Method for Disease Prediction: Hybrid of Random Forest and Multivariate Adaptive Regression Splines
  • 本地全文:下载
  • 作者:Yao, Dengju ; Yang, Jing ; Zhan, Xiaojuan
  • 期刊名称:Journal of Computers
  • 印刷版ISSN:1796-203X
  • 出版年度:2013
  • 卷号:8
  • 期号:1
  • 页码:170-177
  • DOI:10.4304/jcp.8.1.170-177
  • 语种:English
  • 出版社:Academy Publisher
  • 摘要:Using data mining technology for disease prediction and diagnosis has become the focus of attention. Data mining technology provides an important means for extracting valuable medical rules hidden in medical data and acts as an important role in disease prediction and clinical diagnosis. This paper surveys some kind of popular data mining techniques for disease prediction and diagnosis, such as decision tree, associated rule analysis and clustering analysis. Then, a novel hybrid method of random forest and multivariate adaptive regression splines is proposed for building disease prediction model. Firstly, random forest algorithm is used to perform a preliminary screening of variables and to gain an importance ranks. Then, the new dataset selected by top-k important predictors is input into the MARS procedure, which is responsible for building interpretable models for predicting disease survivability. The capability of this combination method is evaluated using basic performance measurements (e.g., accuracy, sensitivity, and specificity) along with a 10-fold cross-validation. Experimental results show that the proposed method provides a higher accuracy and a relatively simple model.
  • 关键词:data mining;medical data;random forest;multivariate adaptive regression splines
国家哲学社会科学文献中心版权所有