Regularized estimation of large covariance matrices

Peter J. Bickel; Elizaveta Levina

doi:10.1214/009053607000000758

February 2008 Regularized estimation of large covariance matrices

Peter J. Bickel, Elizaveta Levina

Ann. Statist. 36(1): 199-227 (February 2008). DOI: 10.1214/009053607000000758

Abstract

This paper considers estimating a covariance matrix of p variables from n observations by either banding or tapering the sample covariance matrix, or estimating a banded version of the inverse of the covariance. We show that these estimates are consistent in the operator norm as long as (log p)/n→0, and obtain explicit rates. The results are uniform over some fairly natural well-conditioned families of covariance matrices. We also introduce an analogue of the Gaussian white noise model and show that if the population covariance is embeddable in that model and well-conditioned, then the banded approximations produce consistent estimates of the eigenvalues and associated eigenvectors of the covariance matrix. The results can be extended to smooth versions of banding and to non-Gaussian distributions with sufficiently short tails. A resampling approach is proposed for choosing the banding parameter in practice. This approach is illustrated numerically on both simulated and real data.

Citation

Download Citation

Peter J. Bickel. Elizaveta Levina. "Regularized estimation of large covariance matrices." Ann. Statist. 36 (1) 199 - 227, February 2008. https://doi.org/10.1214/009053607000000758