文章基本信息

标题：Optimal variable selection in multi-group sparse discriminant analysis
本地全文：下载
作者：Irina Gaynanova ; Mladen Kolar
期刊名称：Electronic Journal of Statistics
印刷版ISSN：1935-7524
出版年度：2015
卷号：9
期号：2
页码：2007-2034
DOI：10.1214/15-EJS1064
语种：English
出版社：Institute of Mathematical Statistics
摘要：This article considers the problem of multi-group classification in the setting where the number of variables $p$ is larger than the number of observations $n$. Several methods have been proposed in the literature that address this problem, however their variable selection performance is either unknown or suboptimal to the results known in the two-group case. In this work we provide sharp conditions for the consistent recovery of relevant variables in the multi-group case using the discriminant analysis proposal of Gaynanova et al. [7]. We achieve the rates of convergence that attain the optimal scaling of the sample size $n$, number of variables $p$ and the sparsity level $s$. These rates are significantly faster than the best known results in the multi-group case. Moreover, they coincide with the minimax optimal rates for the two-group case. We validate our theoretical results with numerical analysis.
关键词：Classification;Fisher’s discriminant analysis, group penalization;high-dimensional statistics.