December 2021 Optimal adaptivity of signed-polygon statistics for network testing
Jiashun Jin, Zheng Tracy Ke, Shengming Luo
Author Affiliations +
Ann. Statist. 49(6): 3408-3433 (December 2021). DOI: 10.1214/21-AOS2089

Abstract

Given a symmetric social network, we are interested in testing whether it has only one community or multiple communities. The desired tests should (a) accommodate severe degree heterogeneity, (b) accommodate mixed memberships, (c) have a tractable null distribution and (d) adapt automatically to different levels of sparsity, and achieve the optimal phase diagram. How to find such a test is a challenging problem.

We propose the Signed Polygon as a class of new tests. Fixing m3, for each m-gon in the network, define a score using the centered adjacency matrix. The sum of such scores is then the mth order Signed Polygon statistic. The Signed Triangle (SgnT) and the Signed Quadrilateral (SgnQ) are special examples of the Signed Polygon.

We show that both the SgnT and SgnQ tests satisfy (a)–(d), and especially, they work well for both very sparse and less sparse networks. Our proposed tests compare favorably with existing tests. For example, the EZ and GC tests behave unsatisfactorily in the less sparse case and do not achieve the optimal phase diagram. Also, many existing tests do not allow for severe heterogeneity or mixed memberships, and they behave unsatisfactorily in our settings.

The analysis of the SgnT and SgnQ tests is delicate and extremely tedious, and the main reason is that we need a unified proof that covers a wide range of sparsity levels and a wide range of degree heterogeneity. For lower bound theory, we use a phase transition framework, which includes the standard minimax argument, but is more informative. The proof uses classical theorems on matrix scaling.

Funding Statement

JJ and SL are supported in part by NSF Grant DMS-2015469. ZK is supported in part by NSF Grant DMS-1925845 and NSF CAREER Grant DMS-1943902.

Acknowledgments

The authors would like to thank the anonymous Associate Editor and referees for helpful comments. ZK would like to thank Sebastien Bubeck, Fang Han, and Elchanan Mossel for helpful pointers and comments.

Funding Statement

JJ and SL are supported in part by NSF Grant DMS-2015469. ZK is supported in part by NSF Grant DMS-1925845 and NSF CAREER Grant DMS-1943902.

Acknowledgments

The authors would like to thank the anonymous Associate Editor and referees for helpful comments. ZK would like to thank Sebastien Bubeck, Fang Han, and Elchanan Mossel for helpful pointers and comments.

Citation

Download Citation

Jiashun Jin. Zheng Tracy Ke. Shengming Luo. "Optimal adaptivity of signed-polygon statistics for network testing." Ann. Statist. 49 (6) 3408 - 3433, December 2021. https://doi.org/10.1214/21-AOS2089

Information

Received: 1 May 2019; Revised: 1 May 2021; Published: December 2021
First available in Project Euclid: 14 December 2021

MathSciNet: MR4352535
zbMATH: 1486.62168
Digital Object Identifier: 10.1214/21-AOS2089

Subjects:
Primary: 62H30 , 91C20
Secondary: 62P25

Keywords: asymptotic normality , DCBM , DCMM , lower bound , matrix scaling , optimal phase diagram , phase transition , SBM , signed quadrilateral , signed triangle , Sinkhorn’s theorem , Sparsity

Rights: Copyright © 2021 Institute of Mathematical Statistics

JOURNAL ARTICLE
26 PAGES

This article is only available to subscribers.
It is not available for individual sale.
+ SAVE TO MY LIBRARY

Vol.49 • No. 6 • December 2021
Back to Top