The Annals of Statistics

Covariance regularization by thresholding

Peter J. Bickel and Elizaveta Levina

This paper considers regularizing a covariance matrix of p variables estimated from n observations, by hard thresholding. We show that the thresholded estimate is consistent in the operator norm as long as the true covariance matrix is sparse in a suitable sense, the variables are Gaussian or sub-Gaussian, and (log p)/n→0, and obtain explicit rates. The results are uniform over families of covariance matrices which satisfy a fairly natural notion of sparsity. We discuss an intuitive resampling scheme for threshold selection and prove a general cross-validation result that justifies this approach. We also compare thresholding to other covariance estimators in simulations and on an example from climate data.

Ann. Statist. Volume 36, Number 6 (2008), 2577-2604.

First available: 5 January 2009

Primary: 62H12: Estimation
Secondary: 62F12: Asymptotic properties of estimators 62G09: Resampling methods

Covariance estimation regularization sparsity thresholding large p small n high dimension low sample size


Bickel, Peter J.; Levina, Elizaveta. Covariance regularization by thresholding. The Annals of Statistics 36 (2008), no. 6, 2577--2604. doi:10.1214/08-AOS600.

