Open Access
October 2017 Spectrum estimation from samples
Weihao Kong, Gregory Valiant
Ann. Statist. 45(5): 2218-2247 (October 2017). DOI: 10.1214/16-AOS1525

Abstract

We consider the problem of approximating the set of eigenvalues of the covariance matrix of a multivariate distribution (equivalently, the problem of approximating the “population spectrum”), given access to samples drawn from the distribution. We consider this recovery problem in the regime where the sample size is comparable to, or even sublinear in the dimensionality of the distribution. First, we propose a theoretically optimal and computationally efficient algorithm for recovering the moments of the eigenvalues of the population covariance matrix. We then leverage this accurate moment recovery, via a Wasserstein distance argument, to accurately reconstruct the vector of eigenvalues. Together, this yields an eigenvalue reconstruction algorithm that is asymptotically consistent as the dimensionality of the distribution and sample size tend toward infinity, even in the sublinear sample regime where the ratio of the sample size to the dimensionality tends to zero. In addition to our theoretical results, we show that our approach performs well in practice for a broad range of distributions and sample sizes.

Citation

Download Citation

Weihao Kong. Gregory Valiant. "Spectrum estimation from samples." Ann. Statist. 45 (5) 2218 - 2247, October 2017. https://doi.org/10.1214/16-AOS1525

Information

Received: 1 February 2016; Revised: 1 October 2016; Published: October 2017
First available in Project Euclid: 31 October 2017

zbMATH: 06821124
MathSciNet: MR3718167
Digital Object Identifier: 10.1214/16-AOS1525

Subjects:
Primary: 62H10 , 62H12

Keywords: eigenvalues of covariance matrices , high-dimensional inference , method of moments , Random matrix theory , Spectrum estimation , sublinear sample size

Rights: Copyright © 2017 Institute of Mathematical Statistics

Vol.45 • No. 5 • October 2017
Back to Top