The Annals of Applied Statistics

Bayesian joint modeling of multiple gene networks and diverse genomic data to identify target genes of a transcription factor

Peng Wei and Wei Pan

Full-text: Open access


We consider integrative modeling of multiple gene networks and diverse genomic data, including protein-DNA binding, gene expression and DNA sequence data, to accurately identify the regulatory target genes of a transcription factor (TF). Rather than treating all the genes equally and independently a priori in existing joint modeling approaches, we incorporate the biological prior knowledge that neighboring genes on a gene network tend to be (or not to be) regulated together by a TF. A key contribution of our work is that, to maximize the use of all existing biological knowledge, we allow incorporation of multiple gene networks into joint modeling of genomic data by introducing a mixture model based on the use of multiple Markov random fields (MRFs). Another important contribution of our work is to allow different genomic data to be correlated and to examine the validity and effect of the independence assumption as adopted in existing methods. Due to a fully Bayesian approach, inference about model parameters can be carried out based on MCMC samples. Application to an E. coli data set, together with simulation studies, demonstrates the utility and statistical efficiency gains with the proposed joint model.

Article information

Ann. Appl. Stat., Volume 6, Number 1 (2012), 334-355.

First available in Project Euclid: 6 March 2012

Permanent link to this document

Digital Object Identifier

Mathematical Reviews number (MathSciNet)

Zentralblatt MATH identifier

Bayesian hierarchical model Markov random field gene networks joint modeling mixture models systems biology


Wei, Peng; Pan, Wei. Bayesian joint modeling of multiple gene networks and diverse genomic data to identify target genes of a transcription factor. Ann. Appl. Stat. 6 (2012), no. 1, 334--355. doi:10.1214/11-AOAS502.

Supplemental materials

  • Supplementary material: Supplemental tables and figures. WinBUGS codes, results for sensitivity analysis and MCMC convergence diagnostics plots can be found in the supplemental article.