Abstract
The singular subspaces perturbation theory is of fundamental importance in probability and statistics. It has various applications across different fields. We consider two arbitrary matrices where one is a leave-one-column-out submatrix of the other one and establish a novel perturbation upper bound for the distance between the two corresponding singular subspaces. It is well suited for mixture models and results in a sharper and finer statistical analysis than classical perturbation bounds such as Wedin’s theorem. Empowered by this leave-one-out perturbation theory, we provide a deterministic entrywise analysis for the performance of spectral clustering under mixture models. Our analysis leads to an explicit exponential error rate for spectral clustering of sub-Gaussian mixture models. For the mixture of isotropic Gaussians, the rate is optimal under a weaker signal-to-noise condition than that of Löffler et al. (2021).
Funding Statement
The first author was supported in part by NSF Grant DMS-2112988.
The second author was supported in part by NSF Grant DMS-2112918.
Acknowledgments
The authors are grateful to an anonymous Associate Editor and anonymous referees for careful reading of the manuscript and their valuable remarks and suggestions.
Citation
Anderson Y. Zhang. Harrison Y. Zhou. "Leave-one-out singular subspace perturbation analysis for spectral clustering." Ann. Statist. 52 (5) 2004 - 2033, October 2024. https://doi.org/10.1214/24-AOS2418
Information