Open Access
2023 Covariance discriminative power of kernel clustering methods
Abla Kammoun, Romain Couillet
Author Affiliations +
Electron. J. Statist. 17(1): 291-390 (2023). DOI: 10.1214/23-EJS2107

Abstract

Let x1,,xn be independent observations of size p, each of them belonging to one of c distinct classes. We assume that observations within the class a are characterized by their distribution N(0,1pCa) where here C1,,Cc are some non-negative definite p×p matrices. This paper studies the asymptotic behavior of the symmetric matrix Φ˜kl=p(xkTxl)2δkl when p and n grow to infinity with npc0. Particularly, we prove that, if the class covariance matrices are sufficiently close in a certain sense, the matrix Φ˜ behaves like a low-rank perturbation of a Wigner matrix, presenting possibly some isolated eigenvalues outside the bulk of the semi-circular law. We carry out a careful analysis of some of the isolated eigenvalues of Φ˜ and their associated eigenvectors and illustrate how these results can help understand spectral clustering methods that use Φ˜ as a kernel matrix.

Funding Statement

The research reported in this publication was supported by funding from King Abdullah University of Science and Technology (KAUST). The work of Couillet is supported by the ANR Project RMT4GRAPH (ANR-14-CE28-0006) and the HUAWEI RMTin5G project.

Acknowledgments

The authors would like to deeply thank an anonymous reviewer for his careful reading and valuable comments, which helped us to improve the quality of the manuscript.

Citation

Download Citation

Abla Kammoun. Romain Couillet. "Covariance discriminative power of kernel clustering methods." Electron. J. Statist. 17 (1) 291 - 390, 2023. https://doi.org/10.1214/23-EJS2107

Information

Received: 1 November 2021; Published: 2023
First available in Project Euclid: 26 January 2023

MathSciNet: MR4540917
zbMATH: 1507.60015
Digital Object Identifier: 10.1214/23-EJS2107

Subjects:
Primary: 15A18‎ , 15B52
Secondary: 60F15

Keywords: clustering , Gaussian calculus , Kernel random matrices , machine learning , multivariate statistical analysis , Random matrix theory , Stieltjes transform

Vol.17 • No. 1 • 2023
Back to Top