The Annals of Applied Statistics
- Ann. Appl. Stat.
- Volume 12, Number 4 (2018), 2674-2698.
Variational inference for probabilistic Poisson PCA
Many application domains, such as ecology or genomics, have to deal with multivariate non-Gaussian observations. A typical example is the joint observation of the respective abundances of a set of species in a series of sites aiming to understand the covariations between these species. The Gaussian setting provides a canonical way to model such dependencies but does not apply in general. We consider here the multivariate exponential family framework for which we introduce a generic model with multivariate Gaussian latent variables. We show that approximate maximum likelihood inference can be achieved via a variational algorithm for which gradient descent easily applies. We show that this setting enables us to account for covariates and offsets. We then focus on the case of the Poisson-lognormal model in the context of community ecology. We demonstrate the efficiency of our algorithm on microbial ecology datasets. We illustrate the importance of accounting for the effects of covariates to better understand interactions between species.
Ann. Appl. Stat., Volume 12, Number 4 (2018), 2674-2698.
Received: March 2017
Revised: February 2018
First available in Project Euclid: 13 November 2018
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Chiquet, Julien; Mariadassou, Mahendra; Robin, Stéphane. Variational inference for probabilistic Poisson PCA. Ann. Appl. Stat. 12 (2018), no. 4, 2674--2698. doi:10.1214/18-AOAS1177. https://projecteuclid.org/euclid.aoas/1542078060