Abstract
Understanding cell type composition and gene expression of spatial transcriptomic data is crucial for comprehending phenotypic variability and detecting key factors that influence disease susceptibility of complex traits. Detecting cell type specific expression patterns from spatial transcriptome profiles is important in studying the cellular components and gene expression of individual cell classes and structural architecture. In this paper we develop mixed effect multiplicative-additive Poisson-gamma models to analyze spatial (MAPS) transcriptomics data using cell type-specific gene expressions in single cell RNA-sequenceing (scRNA-seq) data. To build the mixed effect multiplicative-additive Poisson-gamma models, the gene expression counts of spatial transcriptomics data are treated as dependent variables, and the mean and variance parameters of scRNA-seq data are used to construct independent variables to explain the dependent variables on the basis of Poisson-gamma mixture. One novelty of the proposed mixed models is that the variance parameters of scRNA-seq are used to describe the within-cell-type variations or stochasticity. We develop iteratively analytical formulae to estimate the cell type proportions and dispersion parameters. To address the important research problems and help with intensive spatial transcriptomics data analysis, a readily available software, MAPS, is developed to implement the proposed methods. By simulation study and real data analysis, MAPS is found to perform better than or similar to robust cell type decomposition (RCTD), SpatialDWLS (dampened weighted least squares), conditional autoregressive-based deconvolution (CARD), and a Spatially weighted pOissoN-gAmma Regression model (SONAR). Computationally, MAPS is significantly faster than RCTD and SpatialDWLS. MAPS provides a novel way for mapping spatial tissue architecture.
Acknowledgments
This study was supported by the Intramural Research Program of the National Human Genome Research Institute, National Institutes of Health (NIH), Bethesda, Maryland, under the supervision of Dr. Baxevanis and Dr. Bailey-Wilson (Ruzong Fan). This study utilizes the high-performance computational capabilities of the Biowulf Linux cluster at the National Institutes of Health, Bethesda, Maryland (http://biowulf.nih.gov). The authors acknowledge the Texas Advanced Computing Center (TACC) at The University of Texas at Austin for providing high-performance computational capabilities, visualization, database, or grid resources that have contributed to the research results reported within this paper. URL: http://www.tacc.utexas.edu
RF conceived the study. YL, CA, JEBW, and RF designed the study and wrote the manuscript. YL performed simulation studies and analyzeed the data. CA and JEBW helped interpret results and edited the manuscript.
All authors declare no conflicts of interest in this paper.
Data sharing is not applicable to this article as no new data were created or analyzed in this study.
The methods proposed in this paper are implemented in the statistical package R named MAPS. The R codes for data analysis and simulations are available from the web https://sites.google.com/a/georgetown.edu/ruzong-fan/software.
Citation
Yutong Luo. Joan E. Bailey-Wilson. Christopher Albanese. Ruzong Fan. "Deconvolution analysis of spatial transcriptomics by multiplicative-additive Poisson-gamma models." Ann. Appl. Stat. 18 (4) 3570 - 3595, December 2024. https://doi.org/10.1214/24-AOAS1953
Information