首页    期刊浏览 2025年02月21日 星期五
登录注册

文章基本信息

  • 标题:wsrf: An R Package for Classification with Scalable Weighted Subspace Random Forests
  • 本地全文:下载
  • 作者:He Zhao ; Graham J. Williams ; Joshua Zhexue Huang
  • 期刊名称:Journal of Statistical Software
  • 印刷版ISSN:1548-7660
  • 电子版ISSN:1548-7660
  • 出版年度:2017
  • 卷号:77
  • 期号:1
  • 页码:1-30
  • 语种:English
  • 出版社:University of California, Los Angeles
  • 摘要:We describe a parallel implementation in R of the weighted subspace random forest algorithm (Xu, Huang, Williams, Wang, and Ye 2012) available as the wsrf package. A novel variable weighting method is used for variable subspace selection in place of the traditional approach of random variable sampling. This new approach is particularly useful in building models for high dimensional data - often consisting of thousands of variables. Parallel computation is used to take advantage of multi-core machines and clusters of machines to build random forest models from high dimensional data in considerably shorter times. A series of experiments presented in this paper demonstrates that wsrf is faster than existing packages whilst retaining and often improving on the classification performance, particularly for high dimensional data.
  • 关键词:wsrf;weighted random forests;scalable;parallel computation;big data
国家哲学社会科学文献中心版权所有