Open Access
February 2023 ScreeNOT: Exact MSE-optimal singular value thresholding in correlated noise
David Donoho, Matan Gavish, Elad Romanov
Author Affiliations +
Ann. Statist. 51(1): 122-148 (February 2023). DOI: 10.1214/22-AOS2232

Abstract

We derive a formula for optimal hard thresholding of the singular value decomposition in the presence of correlated additive noise; although it nominally involves unobservables, we show how to apply it even where the noise covariance structure is not a priori known or is not independently estimable. The proposed method, which we call ScreeNOT, is a mathematically solid alternative to Cattell’s ever-popular but vague scree plot heuristic from 1966. ScreeNOT has a surprising oracle property: it typically achieves exactly, in large finite samples, the lowest possible MSE for matrix recovery, on each given problem instance, that is, the specific threshold it selects gives exactly the smallest achievable MSE loss among all possible threshold choices for that noisy data set and that unknown underlying true low rank model. The method is computationally efficient and robust against perturbations of the underlying covariance structure. Our results depend on the assumption that the singular values of the noise have a limiting empirical distribution of compact support; this property, which is standard in random matrix theory, is satisfied by many models exhibiting either cross-row correlation structure or cross-column correlation structure, and also by many situations with more general, interelement correlation structure. Simulations demonstrate the effectiveness of the method even at moderate matrix sizes. The paper is supplemented by ready-to-use software packages implementing the proposed algorithm: package ScreeNOT in Python (via PyPI) and R (via CRAN).

Funding Statement

DD was supported in part by NSF Grants DMS-1407813, 1418362 and 1811614. MG was supported in part by Israel Science Foundation grant 1523/16 and 871/22. This work was made possible by United States–Israel Binational Science Foundation (BSF) Grant 2016201 “Frontiers of Matrix Recovery.” ER was affiliated with the School of Computer Science and Engineering, the Hebrew University of Jerusalem, and supported in part by Israel Science Foundation grant 1523/16 and an Einstein–Kaye Fellowship from the Hebrew University of Jerusalem.

Acknowledgments

We are grateful to the anonymous reviewers for their thoughtful comments, which have helped improve this manuscript considerably.

Citation

Download Citation

David Donoho. Matan Gavish. Elad Romanov. "ScreeNOT: Exact MSE-optimal singular value thresholding in correlated noise." Ann. Statist. 51 (1) 122 - 148, February 2023. https://doi.org/10.1214/22-AOS2232

Information

Received: 1 May 2021; Revised: 1 July 2022; Published: February 2023
First available in Project Euclid: 23 March 2023

MathSciNet: MR4564851
zbMATH: 07684007
Digital Object Identifier: 10.1214/22-AOS2232

Subjects:
Primary: 62C20 , 62H25
Secondary: 90C22 , 90C25

Keywords: high-dimensional asymptotics , low-rank matrix denoising , optimal threshold , scree plot , singular value thresholding

Rights: Copyright © 2023 Institute of Mathematical Statistics

Vol.51 • No. 1 • February 2023
Back to Top