October 2024 Improved covariance estimation: Optimal robustness and sub-Gaussian guarantees under heavy tails
Roberto I. Oliveira, Zoraida F. Rico
Author Affiliations +
Ann. Statist. 52(5): 1953-1977 (October 2024). DOI: 10.1214/24-AOS2407

Abstract

We present an estimator of the covariance matrix Σ of random d-dimensional vector from an i.i.d. sample of size n. Our sole assumption is that this vector satisfies a bounded LpL2 moment assumption over its one-dimensional marginals, for some p4. Given this, we show that Σ can be estimated from the sample with the same high-probability error rates that the sample covariance matrix achieves in the case of Gaussian data. This holds even though we allow for very general distributions that may not have moments of order > p. Moreover, our estimator can be made to be optimally robust to adversarial contamination. This result improves the recent contributions by Mendelson and Zhivotovskiy and Catoni and Giulini, and matches parallel work by Abdalla and Zhivotovskiy (the exact relationship with this last work is described in the paper).

Funding Statement

The first author was supported by a “Bolsa de Produtividade em Pesquisa” and a “Projeto Universal” (432310/2018-5) from CNPq, Brazil; and by a “Cientista do Nosso Estado” grant (E26/200.485/2023) from FAPERJ, Rio de Janeiro, Brazil.

Citation

Download Citation

Roberto I. Oliveira. Zoraida F. Rico. "Improved covariance estimation: Optimal robustness and sub-Gaussian guarantees under heavy tails." Ann. Statist. 52 (5) 1953 - 1977, October 2024. https://doi.org/10.1214/24-AOS2407

Information

Received: 1 October 2022; Revised: 1 January 2024; Published: October 2024
First available in Project Euclid: 20 November 2024

Digital Object Identifier: 10.1214/24-AOS2407

Subjects:
Primary: 62G35
Secondary: 62H12

Keywords: Covariance estimation , heavy-tailed data , robust statistics

Rights: Copyright © 2024 Institute of Mathematical Statistics

Vol.52 • No. 5 • October 2024
Back to Top