The Annals of Applied Statistics
- Ann. Appl. Stat.
- Volume 5, Number 1 (2011), 309-336.
Overlapping stochastic block models with application to the French political blogosphere
Complex systems in nature and in society are often represented as networks, describing the rich set of interactions between objects of interest. Many deterministic and probabilistic clustering methods have been developed to analyze such structures. Given a network, almost all of them partition the vertices into disjoint clusters, according to their connection profile. However, recent studies have shown that these techniques were too restrictive and that most of the existing networks contained overlapping clusters. To tackle this issue, we present in this paper the Overlapping Stochastic Block Model. Our approach allows the vertices to belong to multiple clusters, and, to some extent, generalizes the well-known Stochastic Block Model [Nowicki and Snijders (2001)]. We show that the model is generically identifiable within classes of equivalence and we propose an approximate inference procedure, based on global and local variational techniques. Using toy data sets as well as the French Political Blogosphere network and the transcriptional network of Saccharomyces cerevisiae, we compare our work with other approaches.
Ann. Appl. Stat. Volume 5, Number 1 (2011), 309-336.
First available in Project Euclid: 21 March 2011
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Latouche, Pierre; Birmelé, Etienne; Ambroise, Christophe. Overlapping stochastic block models with application to the French political blogosphere. Ann. Appl. Stat. 5 (2011), no. 1, 309--336. doi:10.1214/10-AOAS382. http://projecteuclid.org/euclid.aoas/1300715192.
- Appendix. Describe how global and local variational techniques can be used to obtain a tractable lower bound. Introduce the optimization equations for the inference procedure.