Abstract
Inferring gene regulatory networks can elucidate how genes work cooperatively. The gene-gene collaboration information is often learned by Gaussian graphical models (GGM) that aim to identify whether the expression levels of any pair of genes are dependent, given other genes’ expression values. One basic assumption that guarantees the validity of GGM is data normality, and this often holds for bulk-level expression data which aggregate biological signals from a collection of cells. However, fine-grained cell-level expression profiles collected in single-cell RNA-sequencing (scRNA-seq) reveal nonnormality features—cellular heterogeneity and zero inflation. We propose a Bayesian latent mixture GGM to jointly estimate multiple gene regulatory networks accounting for the zero inflation and unknown heterogeneity of single-cell expression data. The proposed approach outperforms competing methods on synthetic data in terms of network structure and precision matrix estimation accuracy and provides biological insights when applied to two real-world scRNA-seq datasets. An R package implementing the proposed model is available on GitHub https://github.com/WgitU/BLGGM.
Funding Statement
Xiangyu Luo was supported in part by National Natural Science Foundation of China (11901572), the Fundamental Research Funds for the Central Universities, the Research Funds of Renmin University of China (19XNLG08) and the fund for building world-class universities (disciplines) of Renmin University of China.
Acknowledgments
We are grateful to the Editor and three reviewers for their constructive and invaluable comments which have greatly improved the quality of the paper. We thank the High-performance Computing Platform of Renmin University of China for providing computing resources.
Citation
Qiuyu Wu. Xiangyu Luo. "Estimating heterogeneous gene regulatory networks from zero-inflated single-cell expression data." Ann. Appl. Stat. 16 (4) 2183 - 2200, December 2022. https://doi.org/10.1214/21-AOAS1582
Information