Bayesian Analysis

Bayesian Graphical Models for Differential Pathways

Riten Mitra, Peter Müller, and Yuan Ji

Full-text: Open access

Abstract

Graphical models can be used to characterize the dependence structure for a set of random variables. In some applications, the form of dependence varies across different subgroups. This situation arises, for example, when protein activation on a certain pathway is recorded, and a subgroup of patients is characterized by a pathological disruption of that pathway. A similar situation arises when one subgroup of patients is treated with a drug that targets that same pathway. In both cases, understanding changes in the joint distribution and dependence structure across the two subgroups is key to the desired inference. Fitting a single model for the entire data could mask the differences. Separate independent analyses, on the other hand, could reduce the effective sample size and ignore the common features. In this paper, we develop a Bayesian graphical model that addresses heterogeneity and implements borrowing of strength across the two subgroups by simultaneously centering the prior towards a global network. The key feature is a hierarchical prior for graphs that borrows strength across edges, resulting in a comparison of pathways across subpopulations (differential pathways) under a unified model-based framework. We apply the proposed model to data sets from two very different studies: histone modifications from ChIP-seq experiments, and protein measurements based on tissue microarrays.

Article information

Source
Bayesian Anal., Volume 11, Number 1 (2016), 99-124.

Dates
First available in Project Euclid: 13 February 2015

Permanent link to this document
https://projecteuclid.org/euclid.ba/1423839169

Digital Object Identifier
doi:10.1214/14-BA931

Mathematical Reviews number (MathSciNet)
MR3447093

Zentralblatt MATH identifier
1359.62282

Keywords
autologistic regression histone modifications Markov random fields networks reverse phase protein arrays

Citation

Mitra, Riten; Müller, Peter; Ji, Yuan. Bayesian Graphical Models for Differential Pathways. Bayesian Anal. 11 (2016), no. 1, 99--124. doi:10.1214/14-BA931. https://projecteuclid.org/euclid.ba/1423839169


Export citation

References

  • Atay-Kayis, A. and Massam, H. (2005). “A Monte Carlo method for computing the marginal likelihood in nondecomposable Gaussian graphical models.” Biometrika, 92(2): 317–335.
  • Atchade, Y., Lartillot, N., and Robert, C. (2008). “Bayesian computation for statistical models with intractable normalizing constants.” Technical report, University of Michigan, Department of Statistics.
  • Barski, A., Cuddapah, S., Cui, K., Roh, T. Y., Schones, D. E., Wang, Z., Wei, G., Chepelev, I., and Zhao, K. (2007). “High-resolution profiling of histone methylations in the human genome.” Cell, 129: 823–837.
  • Besag, J. (1974). “Spatial Interaction and the Statistical Analysis of Lattice Systems.” Journal of Royal Statistical Society Series B, 135: 192–236.
  • Carvalho, C. and Scott, J. (2009). “Objective Bayesian model selection in Gaussian graphical models.” Biometrika, 96(3): 497–512.
  • Carvalho, C. M., Massam, H., and West, M. (2007). “Simulation of Hyper-inverse Wishart Distributions in Graphical Models.” Biometrika, 94(3): 647–659.
  • Chen, M.-H. and Shao, Q.-M. (1997). “On Monte Carlo methods for estimating ratios of normalizing constants.” The Annals of Statistics, 25: 1563–1594.
  • Chen, M.-H., Shao, Q.-M., and Ibrahim, J. (2000). Monte Carlo Methods in Bayesian Computation. Springer Verlag, New York.
  • Chen, S., Witten, D., and Shojaie, A. (2013). “Selection and Estimation for Mixed Graphical Models.” arXiv:1311.0085.
  • Chiquet, J., Grandvalet, Y., and Ambroise, C. (2011). “Inferring multiple graphical structures.” Statistics and Computing, 21(4): 537–553.
  • Danaher, P., Wang, P., and Witten, D. M. (2013). “The joint graphical lasso for inverse covariance estimation across multiple classes.” Journal of the Royal Statistical Society: Series B (Statistical Methodology), 76(2): 373–397.
  • Dobra, A., Hans, C., Jones, B., Nevins, J. R., and West, M. (2004). “Sparse graphical models for exploring gene expression data.” Journal of Multivariate Analysis, 90: 196–212.
  • Dobra, A. and Lenkoski, A. (2011). “Copula Gaussian Graphical Models and Their Application to Modeling Functional Disability Data.” The Annals of Applied Statistics, 5(2A): 969–993.
  • Dobra, A., Lenkoski, A., and Rodriguez, A. (2011). “Bayesian inference for general Gaussian graphical models with application to multivariate lattice data.” Journal of the American Statistical Association, 106(496): 1418–1433.
  • Giudici, P. and Green, P. (1999). “Decomposable graphical Gaussian model determination.” Biometrika, 86(4): 785–801.
  • Green, R. (1995). “Reversible jump Markov chain Monte Carlo computation and Bayesian model determination.” Biometrika, 82.
  • Guo, J., Levina, E., Michailidis, G., and Zhu, J. (2011). “Histone modifications as markers of cancer prognosis: a cellular view.” Biometrika, 98: 1–15.
  • Hara, S. and Washio, T. (2013). “Learning a common substructure of multiple graphical Gaussian models.” Neural Networks, 38: 23–38.
  • Jones, B., Carvalho, C., Dobra, A., Hans, C., Carter, C., and West, M. (2004). “Experiments in Stochastic Computation for High-Dimensional Graphical Models.” Statistical Science, 20: 388–400.
  • Kurdistani, S. (2007). “Histone modifications as markers of cancer prognosis: a cellular view.” British Journal of Cancer, 97: 1–5.
  • — (2011). “Histone modifications in cancer biology and prognosis.” Epigenetics and Disease, 67: 91–106.
  • Lauritzen, S. L. and Sheehan, N. A. (2003). “Graphical Models for Genetic Analyses.” Statistical Science, 18(4): 489–514.
  • Lenkoski, A. and Dobra, A. (2011). “Computational aspects related to inference in Gaussian graphical models with the G-Wishart prior.” Journal of Computational and Graphical Statistics, 20(1): 140–157.
  • Liu, C. L., Kaplan, T., Kim, M., Buratowski, S., Schreiber, S. L., Friedman, N., and Rando, O. J. (2005). “Single-nucleosome mapping of histone modifications in S. cerevisiae.” PLoS Biology, 3: e328.
  • Lloyd, C. J. (1998). “Using smoothed receiver operating characteristic curves to summarize and compare diagnostic systems.” Journal of the American Statistical Association, 93(444): 1356–1364.
  • Meinshausen, N. and Bühlmann, P. (2006). “High-dimensional graphs and variable selection with the lasso.” The Annals of Statistics, 34(3): 1436–1462.
  • Mitra, R., Müller, P., Liang, S., Yue, L., and Ji, Y. (2013). “A Bayesian Graphical Model for Chip-Seq Data on Histone Modifications.” Journal of American Statistical Association, 108: 69–90.
  • Mitsakakis, N., Massam, H., and Escobar, M. D. (2011). “A Metropolis-Hastings Based Method for Sampling from the ${G}$-Wishart Distribution in Gaussian Graphical Models.” Electronic Journal of Statistics, 5: 18–30.
  • Moeller, J., Pettitt, A. N., Berthelsen, K. K., and Reeves, R. W. (2006). “An efficient Markov chain Monte Carlo method for distributions with intractable normalising constants.” Biometrika, 93(2): 451–458.
  • Mohan, K., Chung, M. J.-Y., Han, S., Witten, D. M., Lee, S.-I., and Fazel, M. (2012). “Structured Learning of Gaussian Graphical Models.” In: NIPS, 629–637.
  • Mohan, K., London, P., Fazel, M., Lee, S.-I., and Witten, D. (2013). “Node-based learning of multiple gaussian graphical models.” arXiv:1303.5145.
  • Müller, P., Parmigiani, G., and Rice, K. (2007). “FDR and Bayesian Multiple Comparisons Rules.” In: Bayesian Statistics 8. Oxford University Press.
  • Peterson, C., Stingo, F., and Vannucci, M. (2014). “Bayesian Inference of Multiple Gaussian Graphical Models.” Journal of the American Statistical Association.
  • Piccioni, M. (2000). “Independence structure of natural conjugate densities to exponential families and the Gibbs’ sampler.” Scandinavian journal of statistics, 27(1): 111–127.
  • Ravikumar, P., Wainwright, M. J., and Lafferty, J. D. (2010). “High-dimensional Ising model selection using 1-regularized logistic regression.” The Annals of Statistics, 38(3): 1287–1319.
  • Rodriguez, A., Lenkoski, A., and Dobra, A. (2011). “Sparse covariance estimation in heterogeneous samples.” Electronic Journal of Statistics, 5: 981–1014.
  • Scott, J. and Carvalho, C. (2008). “Feature-inclusion stochastic search for Gaussian graphical models.” Journal of Computational and Graphical Statistics, 17(4): 790–808.
  • Scott, J. G. and Berger, J. O. (2006). “An exploration of aspects of Bayesian multiple testing.” Journal of Statistical Planning and Inference, 136(7): 2144–2162.
  • Scott, J. G. and Berger, J. O. (2010). “Bayes and empirical-Bayes multiplicity adjustment in the variable-selection problem.” Annals of Statistics, 38(5): 2587–2619.
  • Stingo, F. C., Chen, Y. A., Vannucci, M., Barrier, M., and Mirkes, P. E. (2010). “A Bayesian graphical modeling approach to microRNA regulatory network inference.” The Annals of Applied Statistics, 4(4): 2024–2048.
  • Strahl, B. D. and Allis, C. D. (2000). “The language of covalent histone modifications.” Nature, 403: 41–45.
  • Wang, H. (2012). “Bayesian graphical lasso models and efficient posterior computation.” Bayesian Analysis, 7(4): 867–886.
  • Wang, H. and Carvalho, C. M. (2010). “Simulation of hyper-inverse Wishart distributions for non-decomposable graphs.” Electronic Journal of Statistics, 4: 1470–1475.
  • Wang, H. and Li, S. Z. (2012). “Efficient Gaussian graphical model determination under G-Wishart prior distributions.” Electronic Journal of Statistics, 6: 168–198.
  • Wang, Z., Zang, C., Rosenfeld, J. A., Schones, D. E., Barski, A., Cuddapah, S., Cui, K., Roh, T. Y., Peng, W., Zhang, M. Q., and Zhao, K. (2008). “Combinatorial patterns of histone acetylations and methylations in the human genome.” Nature Genetics, 40: 897–903.
  • Wong, F., Carter, C. K., and Kohn, R. (2003). “Efficient estimation of covariance selection models.” Biometrika, 90(4): 809–830.
  • Wright, S. (1934). “The method of path coefficients.” Annals of Mathematical Statistics, 5(3): 161–215.
  • Yang, E., Ravikumar, P., Allen, G. I., and Liu, Z. (2013). “On graphical models via univariate exponential family distributions.” arXiv:1301.4183.
  • Yang, E., Ravikumar, P. D., Allen, G. I., and Liu, Z. (2012). “Graphical Models via Generalized Linear Models.” In: NIPS, volume 25, 1367–1375.
  • Yuan, M. and Lin, Y. (2007). “Model selection and estimation in the Gaussian graphical model.” Biometrika, 94(1): 19–35.
  • Zhang, Y. (2012). “A novel Bayesian graphical model for genome-wide multi-SNP association mapping.” Genetic epidemiology, 36(1): 36–47.
  • Zhang, Y., Lv, J., Liu, H., Zhu, J., Su, J., Wu, Q., Qi, Y., Wang, F., and Li, X. (2010). “HHMD: the human histone modification database.” Nucleic Acids Research, 38: D149–154.