September 2023 The scalable birth–death MCMC algorithm for mixed graphical model learning with application to genomic data integration
Nanwei Wang, Hélène Massam, Xin Gao, Laurent Briollais
Author Affiliations +
Ann. Appl. Stat. 17(3): 1958-1983 (September 2023). DOI: 10.1214/22-AOAS1701

Abstract

Recent advances in biological research have seen the emergence of high-throughput technologies with numerous applications that allow the study of biological mechanisms at an unprecedented depth and scale. A large amount of genomic data is now distributed through consortia like The Cancer Genome Atlas (TCGA), where specific types of biological information on specific type of tissue or cell are available. In cancer research the challenge is now to perform integrative analyses of high-dimensional multiomic data with the goal to better understand genomic processes that correlate with cancer outcomes, for example, elucidate gene networks that discriminate a specific cancer subgroups (cancer subtyping) or discovering gene networks that overlap across different cancer types (pan-cancer studies). In this paper we propose a novel mixed graphical model approach to analyze multiomic data of different types (continuous, discrete and count) and perform model selection by extending the birth–death MCMC (BDMCMC) algorithm initially proposed by Stephens (Ann. Statist. 28 (2000) 40–74) and later developed by Mohammadi and Wit (Bayesian Anal. 10 (2015) 109–138). Using simulations, we compare the performance of our method to the LASSO method and the standard BDMCMC method and find that our method is superior in terms of both computational efficiency and the accuracy of the model selection results. Finally, an application to the TCGA breast cancer data shows that integrating genomic information at different levels (mutation and expression data) leads to better subtyping of breast cancers.

Funding Statement

The first author was supported by OICR-CANSSI Biostatistics Postdoctoral Fellowship, NBIF Grant, NSFC (12001103).
The second and third authors were supported by an NSERC Discovery Grant.
The fourth author was supported by an NSERC Discovery Grant, the CANSSI Health Science Collaborative Program, and CIHR Project Grants.

Acknowledgements

This paper is published in memoriam of Pr. Hélène Massam.

Citation

Download Citation

Nanwei Wang. Hélène Massam. Xin Gao. Laurent Briollais. "The scalable birth–death MCMC algorithm for mixed graphical model learning with application to genomic data integration." Ann. Appl. Stat. 17 (3) 1958 - 1983, September 2023. https://doi.org/10.1214/22-AOAS1701

Information

Received: 1 April 2020; Revised: 1 June 2022; Published: September 2023
First available in Project Euclid: 7 September 2023

MathSciNet: MR4637652
Digital Object Identifier: 10.1214/22-AOAS1701

Keywords: genomic integration , mixed graphical models , SBDMCMC , TCGA

Rights: Copyright © 2023 Institute of Mathematical Statistics

Vol.17 • No. 3 • September 2023
Back to Top