Vertex nomination schemes for membership prediction

D. E. Fishkind; V. Lyzinski; H. Pao; L. Chen; C. E. Priebe

doi:10.1214/15-AOAS834

September 2015 Vertex nomination schemes for membership prediction

D. E. Fishkind, V. Lyzinski, H. Pao, L. Chen, C. E. Priebe

Ann. Appl. Stat. 9(3): 1510-1532 (September 2015). DOI: 10.1214/15-AOAS834

Abstract

Suppose that a graph is realized from a stochastic block model where one of the blocks is of interest, but many or all of the vertices’ block labels are unobserved. The task is to order the vertices with unobserved block labels into a “nomination list” such that, with high probability, vertices from the interesting block are concentrated near the list’s beginning. We propose several vertex nomination schemes. Our basic—but principled—setting and development yields a best nomination scheme (which is a Bayes–Optimal analogue), and also a likelihood maximization nomination scheme that is practical to implement when there are a thousand vertices, and which is empirically near-optimal when the number of vertices is small enough to allow comparison to the best nomination scheme. We then illustrate the robustness of the likelihood maximization nomination scheme to the modeling challenges inherent in real data, using examples which include a social network involving human trafficking, the Enron Graph, a worm brain connectome and a political blog network.

Citation

Download Citation

D. E. Fishkind. V. Lyzinski. H. Pao. L. Chen. C. E. Priebe. "Vertex nomination schemes for membership prediction." Ann. Appl. Stat. 9 (3) 1510 - 1532, September 2015. https://doi.org/10.1214/15-AOAS834