首页    期刊浏览 2024年10月07日 星期一
登录注册

文章基本信息

  • 标题:Regularization and Estimation in Regression with Cluster Variables
  • 本地全文:下载
  • 作者:Qingzhao Yu 1 , Bin Li
  • 期刊名称:Open Journal of Statistics
  • 印刷版ISSN:2161-718X
  • 电子版ISSN:2161-7198
  • 出版年度:2014
  • 卷号:04
  • 期号:10
  • 页码:814-825
  • DOI:10.4236/ojs.2014.410077
  • 语种:English
  • 出版社:Scientific Research Publishing
  • 摘要:Clustering Lasso, a new regularization method for linear regressions is proposed in the paper. The Clustering Lasso can select variable while keeping the correlation structures among variables. In addition, Clustering Lasso encourages selection of clusters of variables, so that variables having the same mechanism of predicting the response variable will be selected together in the regression model. A real microarray data example and simulation studies show that Clustering Lasso outperforms Lasso in terms of prediction performance, particularly when there is collinearity among variables and/or when the number of predictors is larger than the number of observations. The Clustering Lasso paths can be obtained using any established algorithm for Lasso solution. An algorithm is proposed to construct variable correlation structures and to compute Clustering Lasso paths efficiently.
  • 关键词:Clustered Variables; Lasso; Principal Component Analysis
国家哲学社会科学文献中心版权所有