Open Access
September 2016 Gene-proximity models for genome-wide association studies
Ian Johnston, Timothy Hancock, Hiroshi Mamitsuka, Luis Carvalho
Ann. Appl. Stat. 10(3): 1217-1244 (September 2016). DOI: 10.1214/16-AOAS907

Abstract

Motivated by the important problem of detecting association between genetic markers and binary traits in genome-wide association studies, we present a novel Bayesian model that establishes a hierarchy between markers and genes by defining weights according to gene lengths and distances from genes to markers. The proposed hierarchical model uses these weights to define unique prior probabilities of association for markers based on their proximities to genes that are believed to be relevant to the trait of interest. We use an expectation-maximization algorithm in a filtering step to first reduce the dimensionality of the data and then sample from the posterior distribution of the model parameters to estimate posterior probabilities of association for the markers. We offer practical and meaningful guidelines for the selection of the model tuning parameters and propose a pipeline that exploits a singular value decomposition on the raw data to make our model run efficiently on large data sets. We demonstrate the performance of the model in simulation studies and conclude by discussing the results of a case study using a real-world data set provided by the Wellcome Trust Case Control Consortium.

Citation

Download Citation

Ian Johnston. Timothy Hancock. Hiroshi Mamitsuka. Luis Carvalho. "Gene-proximity models for genome-wide association studies." Ann. Appl. Stat. 10 (3) 1217 - 1244, September 2016. https://doi.org/10.1214/16-AOAS907

Information

Received: 1 October 2013; Revised: 1 September 2015; Published: September 2016
First available in Project Euclid: 28 September 2016

zbMATH: 06775265
MathSciNet: MR3553223
Digital Object Identifier: 10.1214/16-AOAS907

Keywords: hierarchical Bayes , Large $p$ small $n$ , Pólya–Gamma latent variable

Rights: Copyright © 2016 Institute of Mathematical Statistics

Vol.10 • No. 3 • September 2016
Back to Top