February 2022 Heteroskedastic PCA: Algorithm, optimality, and applications
Anru R. Zhang, T. Tony Cai, Yihong Wu
Author Affiliations +
Ann. Statist. 50(1): 53-80 (February 2022). DOI: 10.1214/21-AOS2074

Abstract

A general framework for principal component analysis (PCA) in the presence of heteroskedastic noise is introduced. We propose an algorithm called HeteroPCA, which involves iteratively imputing the diagonal entries of the sample covariance matrix to remove estimation bias due to heteroskedasticity. This procedure is computationally efficient and provably optimal under the generalized spiked covariance model. A key technical step is a deterministic robust perturbation analysis on singular subspaces, which can be of independent interest. The effectiveness of the proposed algorithm is demonstrated in a suite of problems in high-dimensional statistics, including singular value decomposition (SVD) under heteroskedastic noise, Poisson PCA, and SVD for heteroskedastic and incomplete data.

Funding Statement

The research of Anru Zhang was supported in part by NSF CAREER award DMS-1944904, NSF Grant DMS-1811868, and NIH Grant R01-GM131399-01.
The research of Tony Cai was supported in part by NSF Grants DMS-1712735 and DMS-2015259 and NIH Grants R01-GM129781 and R01-GM123056.
The research of Yihong Wu was supported in part by the NSF Grant CCF-1527105, an NSF CAREER award CCF-1651588, and an Alfred Sloan fellowship.

Citation

Download Citation

Anru R. Zhang. T. Tony Cai. Yihong Wu. "Heteroskedastic PCA: Algorithm, optimality, and applications." Ann. Statist. 50 (1) 53 - 80, February 2022. https://doi.org/10.1214/21-AOS2074

Information

Received: 1 October 2018; Revised: 1 April 2021; Published: February 2022
First available in Project Euclid: 16 February 2022

MathSciNet: MR4382008
zbMATH: 1486.62183
Digital Object Identifier: 10.1214/21-AOS2074

Subjects:
Primary: 62H12 , 62H25
Secondary: 62C20

Keywords: Factor analysis model , heteroskedasticity , perturbation bound , Principal Component Analysis , Singular value decomposition

Rights: Copyright © 2022 Institute of Mathematical Statistics

Vol.50 • No. 1 • February 2022
Back to Top