首页    期刊浏览 2025年02月19日 星期三
登录注册

文章基本信息

  • 标题:Kernel-Based Ensemble Learning in Python
  • 本地全文:下载
  • 作者:Benjamin Guedj ; Bhargav Srinivasa Desikan
  • 期刊名称:Information
  • 电子版ISSN:2078-2489
  • 出版年度:2020
  • 卷号:11
  • 期号:2
  • 页码:63-74
  • DOI:10.3390/info11020063
  • 出版社:MDPI Publishing
  • 摘要:We propose a new supervised learning algorithm for classification and regression problems where two or more preliminary predictors are available. We introduce KernelCobra, a non-linear learning strategy for combining an arbitrary number of initial predictors. KernelCobra builds on the COBRA algorithm introduced by Biau et al. (2016), which combined estimators based on a notion of proximity of predictions on the training data. While the COBRA algorithm used a binary threshold to declare which training data were close and to be used, we generalise this idea by using a kernel to better encapsulate the proximity information. Such a smoothing kernel provides more representative weights to each of the training points which are used to build the aggregate and final predictor, and KernelCobra systematically outperforms the COBRA algorithm. While COBRA is intended for regression, KernelCobra deals with classification and regression. KernelCobra is included as part of the open source Python package Pycobra (0.2.4 and onward), introduced by Srinivasa Desikan (2018). Numerical experiments were undertaken to assess the performance (in terms of pure prediction and computational complexity) of KernelCobra on real-life and synthetic datasets.
  • 关键词:machine learning; python; ensemble learning; kernels; open source software machine learning ; python ; ensemble learning ; kernels ; open source software
国家哲学社会科学文献中心版权所有