Annals of Applied Statistics
- Ann. Appl. Stat.
- Volume 12, Number 4 (2018), 2674-2698.
Variational inference for probabilistic Poisson PCA
Julien Chiquet, Mahendra Mariadassou, and Stéphane Robin
Abstract
Many application domains, such as ecology or genomics, have to deal with multivariate non-Gaussian observations. A typical example is the joint observation of the respective abundances of a set of species in a series of sites aiming to understand the covariations between these species. The Gaussian setting provides a canonical way to model such dependencies but does not apply in general. We consider here the multivariate exponential family framework for which we introduce a generic model with multivariate Gaussian latent variables. We show that approximate maximum likelihood inference can be achieved via a variational algorithm for which gradient descent easily applies. We show that this setting enables us to account for covariates and offsets. We then focus on the case of the Poisson-lognormal model in the context of community ecology. We demonstrate the efficiency of our algorithm on microbial ecology datasets. We illustrate the importance of accounting for the effects of covariates to better understand interactions between species.
Article information
Source
Ann. Appl. Stat., Volume 12, Number 4 (2018), 2674-2698.
Dates
Received: March 2017
Revised: February 2018
First available in Project Euclid: 13 November 2018
Permanent link to this document
https://projecteuclid.org/euclid.aoas/1542078060
Digital Object Identifier
doi:10.1214/18-AOAS1177
Mathematical Reviews number (MathSciNet)
MR3875716
Zentralblatt MATH identifier
07029470
Keywords
Probabilistic PCA Poisson-lognormal model count data variational inference
Citation
Chiquet, Julien; Mariadassou, Mahendra; Robin, Stéphane. Variational inference for probabilistic Poisson PCA. Ann. Appl. Stat. 12 (2018), no. 4, 2674--2698. doi:10.1214/18-AOAS1177. https://projecteuclid.org/euclid.aoas/1542078060

