Open Access
December 2009 Sparsistency and rates of convergence in large covariance matrix estimation
Clifford Lam, Jianqing Fan
Ann. Statist. 37(6B): 4254-4278 (December 2009). DOI: 10.1214/09-AOS720

Abstract

This paper studies the sparsistency and rates of convergence for estimating sparse covariance and precision matrices based on penalized likelihood with nonconvex penalty functions. Here, sparsistency refers to the property that all parameters that are zero are actually estimated as zero with probability tending to one. Depending on the case of applications, sparsity priori may occur on the covariance matrix, its inverse or its Cholesky decomposition. We study these three sparsity exploration problems under a unified framework with a general penalty function. We show that the rates of convergence for these problems under the Frobenius norm are of order (sn log pn/n)1/2, where sn is the number of nonzero elements, pn is the size of the covariance matrix and n is the sample size. This explicitly spells out the contribution of high-dimensionality is merely of a logarithmic factor. The conditions on the rate with which the tuning parameter λn goes to 0 have been made explicit and compared under different penalties. As a result, for the L1-penalty, to guarantee the sparsistency and optimal rate of convergence, the number of nonzero elements should be small: sn'=O(pn) at most, among O(pn2) parameters, for estimating sparse covariance or correlation matrix, sparse precision or inverse correlation matrix or sparse Cholesky factor, where sn' is the number of the nonzero elements on the off-diagonal entries. On the other hand, using the SCAD or hard-thresholding penalty functions, there is no such a restriction.

Citation

Download Citation

Clifford Lam. Jianqing Fan. "Sparsistency and rates of convergence in large covariance matrix estimation." Ann. Statist. 37 (6B) 4254 - 4278, December 2009. https://doi.org/10.1214/09-AOS720

Information

Published: December 2009
First available in Project Euclid: 23 October 2009

zbMATH: 1191.62101
MathSciNet: MR2572459
Digital Object Identifier: 10.1214/09-AOS720

Subjects:
Primary: 62F12
Secondary: 62J07

Keywords: asymptotic normality , consistency , Covariance matrix , high-dimensionality , nonconcave penalized likelihood , sparsistency

Rights: Copyright © 2009 Institute of Mathematical Statistics

Vol.37 • No. 6B • December 2009
Back to Top