December 2022 Bayesian multivariate sparse functional principal components analysis with application to longitudinal microbiome multiomics data
Lingjing Jiang, Chris Elrod, Jane J. Kim, Austin D. Swafford, Rob Knight, Wesley K. Thompson
Author Affiliations +
Ann. Appl. Stat. 16(4): 2231-2249 (December 2022). DOI: 10.1214/21-AOAS1587

Abstract

Microbiome researchers often need to model the temporal dynamics of multiple complex, nonlinear outcome trajectories simultaneously. This motivates our development of multivariate Sparse Functional Principal Components Analysis (mSFPCA), extending existing SFPCA methods to simultaneously characterize multiple temporal trajectories and their interrelationships. As with existing SFPCA methods, the mSFPCA algorithm characterizes each trajectory as a smooth mean plus a weighted combination of the smooth major modes of variation about the mean, where the weights are given by the component scores for each subject. Unlike existing SFPCA methods, the mSFPCA algorithm allows estimation of multiple trajectories simultaneously, such that the component scores, which are constrained to be independent within a particular outcome for identifiability, may be arbitrarily correlated with component scores for other outcomes. A Cholesky decomposition is used to estimate the component score covariance matrix efficiently and guarantee positive semidefiniteness given these constraints. Mutual information is used to assess the strength of marginal and conditional temporal associations across outcome trajectories. Importantly, we implement mSFPCA as a Bayesian algorithm using R and stan, enabling easy use of packages such as PSIS-LOO for model selection and graphical posterior predictive checks to assess the validity of mSFPCA models. Although we focus on application of mSFPCA to microbiome data in this paper, the mSFPCA model is of general utility and can be used in a wide range of real-world applications.

Funding Statement

RK was supported by NIH under grant 1DP1AT010885, NIDDK under grant 1P30DK120515 and CCFA under grant 675191. WT was supported by NIH/NIMH under grants MH120025 and MH122688.

Citation

Download Citation

Lingjing Jiang. Chris Elrod. Jane J. Kim. Austin D. Swafford. Rob Knight. Wesley K. Thompson. "Bayesian multivariate sparse functional principal components analysis with application to longitudinal microbiome multiomics data." Ann. Appl. Stat. 16 (4) 2231 - 2249, December 2022. https://doi.org/10.1214/21-AOAS1587

Information

Received: 1 January 2021; Revised: 1 November 2021; Published: December 2022
First available in Project Euclid: 26 September 2022

MathSciNet: MR4489207
zbMATH: 1498.62227
Digital Object Identifier: 10.1214/21-AOAS1587

Keywords: Bayesian , Functional data analysis , longitudinal , microbiome , multiomics

Rights: Copyright © 2022 Institute of Mathematical Statistics

JOURNAL ARTICLE
19 PAGES

This article is only available to subscribers.
It is not available for individual sale.
+ SAVE TO MY LIBRARY

Vol.16 • No. 4 • December 2022
Back to Top