文章基本信息

标题：Group variable selection via a hierarchical lasso and its oracle property
本地全文：下载
作者：Nengfeng Zhou ; Ji Zhu
期刊名称：Statistics and Its Interface
印刷版ISSN：1938-7989
电子版ISSN：1938-7997
出版年度：2010
卷号：3
期号：4
页码：557-574
DOI：10.4310/SII.2010.v3.n4.a13
出版社：International Press
摘要：In many engineering and scientific applications, prediction variables are grouped, for example, in biological applications where assayed genes or proteins can be grouped by biological roles or biological pathways. Common statistical analysis methods such as ANOVA, factor analysis, and functional modeling with basis sets also exhibit natural variable groupings. Existing successful group variable selection methods have the limitation of selecting variables in an “all-in-all-out” fashion, i.e., when one variable in a group is selected, all other variables in the same group are also selected. In many real problems, however, we may want to keep the flexibility of selecting variables within a group, such as in gene-set selection. In this paper, we develop a new group variable selection method that not only removes unimportant groups effectively, but also keeps the flexibility of selecting variables within a group. We also show that the new method offers the potential for achieving the theoretical “oracle” property.
关键词：group selection; lasso; oracle property; regularization; variable selection