The Discrete Infinite Logistic Normal Distribution

John Paisley; Chong Wang; David M. Blei

doi:10.1214/12-BA734

December 2012 The Discrete Infinite Logistic Normal Distribution

John Paisley, Chong Wang, David M. Blei

Bayesian Anal. 7(4): 997-1034 (December 2012). DOI: 10.1214/12-BA734

Abstract

We present the discrete infinite logistic normal distribution (DILN), a Bayesian nonparametric prior for mixed membership models. DILN generalizes the hierarchical Dirichlet process (HDP) to model correlation structure between the weights of the atoms at the group level. We derive a representation of DILN as a normalized collection of gamma-distributed random variables and study its statistical properties. We derive a variational inference algorithm for approximate posterior inference. We apply DILN to topic modeling of documents and study its empirical performance on four corpora, comparing performance with the HDP and the correlated topic model (CTM). To compute with large-scale data, we develop a stochastic variational inference algorithm for DILN and compare with similar algorithms for HDP and latent Dirichlet allocation (LDA) on a collection of 350,000 articles from Nature.

Citation

Download Citation

John Paisley. Chong Wang. David M. Blei. "The Discrete Infinite Logistic Normal Distribution." Bayesian Anal. 7 (4) 997 - 1034, December 2012. https://doi.org/10.1214/12-BA734