Open Access
2024 Strong consistency guarantees for clustering high-dimensional bipartite graphs with the spectral method
Guillaume Braun
Author Affiliations +
Electron. J. Statist. 18(2): 2798-2823 (2024). DOI: 10.1214/24-EJS2271

Abstract

We investigate the problem of clustering bipartite graphs using a simple spectral method within the framework of the Bipartite Stochastic Block Model (BiSBM), a popular model for bipartite graphs having a community structure. Our focus lies in the high-dimensional setting where the number n1 of rows, and n2 of columns, of the associated adjacency matrix differ significantly. A recent study by [4] has established a sufficient and necessary condition related to the sparsity level pmax of the bipartite graph, enabling the recovery of the latent partition of the rows. In their work, [4] introduces an iterative method that extends the approach proposed by [26] to achieve the stated recovery goal. However, empirical results suggest that the subsequent refinement algorithm does not significantly enhance the performance of the spectral method, indicating that the spectral method achieves exact recovery within the same regime as the refinement method. We establish this claim by deriving new entrywise bounds on the eigenvectors of the similarity matrix utilized by the spectral method. Our analysis extends the framework of [23], which is limited to symmetric matrices with restricted dependencies. As a critical technical step, we also derive an improved concentration inequality tailored for similarity matrices.

Acknowledgments

The work leading to the preliminary version of this manuscript was carried out while G.B. was a PhD student at Inria Lille in the MODAL team. We are grateful to Hemant Tyagi for providing valuable feedback on the preliminary version of the manuscript. Special appreciation goes to Yizhe Zhu for meticulously identifying inaccuracies within the proofs presented in the preceding version of the manuscript, contributing significantly to its improvement. We would also like to express our sincere gratitude to the anonymous reviewers whose insightful comments and constructive feedback helped in refining the manuscript.

Citation

Download Citation

Guillaume Braun. "Strong consistency guarantees for clustering high-dimensional bipartite graphs with the spectral method." Electron. J. Statist. 18 (2) 2798 - 2823, 2024. https://doi.org/10.1214/24-EJS2271

Information

Received: 1 April 2023; Published: 2024
First available in Project Euclid: 12 July 2024

Digital Object Identifier: 10.1214/24-EJS2271

Subjects:
Primary: 62H30

Keywords: bipartite stochastic block model , clustering , Spectral method

Vol.18 • No. 2 • 2024
Back to Top