The Annals of Applied Statistics

Of mice and men: Sparse statistical modeling in cardiovascular genomics

David M. Seo, Pascal J. Goldschmidt-Clermont, and Mike West

Source: Ann. Appl. Stat. Volume 1, Number 1 (2007), 152-178.

Abstract

In high-throughput genomics, large-scale designed experiments are becoming common, and analysis approaches based on highly multivariate regression and anova concepts are key tools. Shrinkage models of one form or another can provide comprehensive approaches to the problems of simultaneous inference that involve implicit multiple comparisons over the many, many parameters representing effects of design factors and covariates. We use such approaches here in a study of cardiovascular genomics. The primary experimental context concerns a carefully designed, and rich, gene expression study focused on gene-environment interactions, with the goals of identifying genes implicated in connection with disease states and known risk factors, and in generating expression signatures as proxies for such risk factors. A coupled exploratory analysis investigates cross-species extrapolation of gene expression signatures—how these mouse-model signatures translate to humans. The latter involves exploration of sparse latent factor analysis of human observational data and of how it relates to projected risk signatures derived in the animal models. The study also highlights a range of applied statistical and genomic data analysis issues, including model specification, computational questions and model-based correction of experimental artifacts in DNA microarray data.

Related Works:

Keywords: Animal–human extrapolation; atherosclerosis risk factors; gene-environment interactions; gene expression signatures; multivariate anova; latent factor models; sparse statistical modeling

Full-text: Access denied (no subscription detected)

In 2007, access to the Annals of Applied Statistics was open. Beginning in 2008, you must hold a subscription or be a member of the IMS to view the full journal. For more information on subscribing, please visit: http://imstat.org/orders.
If you are already an IMS member, you may need to update your Euclid profile following the instructions here: http://imstat.org/publications/eaccess.htm.
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.aoas/1183143733
Digital Object Identifier: doi:10.1214/07-AOAS110
Mathematical Reviews number (MathSciNet): MR2393845
Zentralblatt MATH identifier: 1129.62104


2009 © Institute of Mathematical Statistics