Abstract
We present an estimator of the covariance matrix Σ of random d-dimensional vector from an i.i.d. sample of size n. Our sole assumption is that this vector satisfies a bounded moment assumption over its one-dimensional marginals, for some . Given this, we show that Σ can be estimated from the sample with the same high-probability error rates that the sample covariance matrix achieves in the case of Gaussian data. This holds even though we allow for very general distributions that may not have moments of order > p. Moreover, our estimator can be made to be optimally robust to adversarial contamination. This result improves the recent contributions by Mendelson and Zhivotovskiy and Catoni and Giulini, and matches parallel work by Abdalla and Zhivotovskiy (the exact relationship with this last work is described in the paper).
Funding Statement
The first author was supported by a “Bolsa de Produtividade em Pesquisa” and a “Projeto Universal” (432310/2018-5) from CNPq, Brazil; and by a “Cientista do Nosso Estado” grant (E26/200.485/2023) from FAPERJ, Rio de Janeiro, Brazil.
Citation
Roberto I. Oliveira. Zoraida F. Rico. "Improved covariance estimation: Optimal robustness and sub-Gaussian guarantees under heavy tails." Ann. Statist. 52 (5) 1953 - 1977, October 2024. https://doi.org/10.1214/24-AOS2407
Information