Translator Disclaimer
April 2022 Motif estimation via subgraph sampling: The fourth-moment phenomenon
Bhaswar B. Bhattacharya, Sayan Das, Sumit Mukherjee
Author Affiliations +
Ann. Statist. 50(2): 987-1011 (April 2022). DOI: 10.1214/21-AOS2134

Abstract

Network sampling is an indispensable tool for understanding features of large complex networks where it is practically impossible to search over the entire graph. In this paper, we develop a framework for statistical inference for counting network motifs, such as edges, triangles and wedges, in the widely used subgraph sampling model, where each vertex is sampled independently, and the subgraph induced by the sampled vertices is observed. We derive necessary and sufficient conditions for the consistency and the asymptotic normality of the natural Horvitz–Thompson (HT) estimator, which can be used for constructing confidence intervals and hypothesis testing for the motif counts based on the sampled graph. In particular, we show that the asymptotic normality of the HT estimator exhibits an interesting fourth-moment phenomenon, which asserts that the HT estimator (appropriately centered and rescaled) converges in distribution to the standard normal whenever its fourth-moment converges to 3 (the fourth-moment of the standard normal distribution). As a consequence, we derive the exact thresholds for consistency and asymptotic normality of the HT estimator in various natural graph ensembles, such as sparse graphs with bounded degree, Erdős–Rényi random graphs, random regular graphs and dense graphons.

Funding Statement

The first author was supported by NSF CAREER Grant DMS-2046393 and a Sloan Research Fellowship.
The third author was supported by NSF Grant DMS-1712037.

Acknowledgments

The authors thank Sohom Bhattacharya for pointing out [35], and Jason Klusowski for helpful discussions. The authors also thank the Associate Editor and the anonymous referees for their detailed and thoughtful comments, which greatly improved the quality and the presentation of the paper.

Citation

Download Citation

Bhaswar B. Bhattacharya. Sayan Das. Sumit Mukherjee. "Motif estimation via subgraph sampling: The fourth-moment phenomenon." Ann. Statist. 50 (2) 987 - 1011, April 2022. https://doi.org/10.1214/21-AOS2134

Information

Received: 1 November 2020; Revised: 1 August 2021; Published: April 2022
First available in Project Euclid: 7 April 2022

Digital Object Identifier: 10.1214/21-AOS2134

Subjects:
Primary: 05C30 , 62G05 , 62G20

Keywords: asymptotic inference , fourth moment phenomenon , Motif counting , network analysis , Random graphs , Stein’s method

Rights: Copyright © 2022 Institute of Mathematical Statistics

JOURNAL ARTICLE
25 PAGES

This article is only available to subscribers.
It is not available for individual sale.
+ SAVE TO MY LIBRARY

SHARE
Vol.50 • No. 2 • April 2022
Back to Top