Sparse PCA: Optimal rates and adaptive estimation

T. Tony Cai; Zongming Ma; Yihong Wu

doi:10.1214/13-AOS1178

December 2013 Sparse PCA: Optimal rates and adaptive estimation

T. Tony Cai, Zongming Ma, Yihong Wu

Ann. Statist. 41(6): 3074-3110 (December 2013). DOI: 10.1214/13-AOS1178

Abstract

Principal component analysis (PCA) is one of the most commonly used statistical procedures with a wide range of applications. This paper considers both minimax and adaptive estimation of the principal subspace in the high dimensional setting. Under mild technical conditions, we first establish the optimal rates of convergence for estimating the principal subspace which are sharp with respect to all the parameters, thus providing a complete characterization of the difficulty of the estimation problem in term of the convergence rate. The lower bound is obtained by calculating the local metric entropy and an application of Fano’s lemma. The rate optimal estimator is constructed using aggregation, which, however, might not be computationally feasible.

We then introduce an adaptive procedure for estimating the principal subspace which is fully data driven and can be computed efficiently. It is shown that the estimator attains the optimal rates of convergence simultaneously over a large collection of the parameter spaces. A key idea in our construction is a reduction scheme which reduces the sparse PCA problem to a high-dimensional multivariate regression problem. This method is potentially also useful for other related problems.

Citation

Download Citation

T. Tony Cai. Zongming Ma. Yihong Wu. "Sparse PCA: Optimal rates and adaptive estimation." Ann. Statist. 41 (6) 3074 - 3110, December 2013. https://doi.org/10.1214/13-AOS1178

Information

Published: December 2013

First available in Project Euclid: 1 January 2014

zbMATH: 1288.62099

MathSciNet: MR3161458

Digital Object Identifier: 10.1214/13-AOS1178

Subjects:

Primary: 62H12

Secondary: 62C20 , 62H25

Keywords: adaptive estimation , Aggregation , Covariance matrix , eigenvector , group sparsity , low-rank matrix , minimax lower bound , Optimal rate of convergence , Principal Component Analysis , thresholding

Access the abstract

JOURNAL ARTICLE
37 PAGES

DOWNLOAD PDF + SAVE TO MY LIBRARY