Open Access
March 2010 Regularized multivariate regression for identifying master predictors with application to integrative genomics study of breast cancer
Jie Peng, Ji Zhu, Anna Bergamaschi, Wonshik Han, Dong-Young Noh, Jonathan R. Pollack, Pei Wang
Ann. Appl. Stat. 4(1): 53-77 (March 2010). DOI: 10.1214/09-AOAS271

Abstract

In this paper we propose a new method remMap—REgularized Multivariate regression for identifying MAster Predictors—for fitting multivariate response regression models under the high-dimension–low-sample-size setting. remMap is motivated by investigating the regulatory relationships among different biological molecules based on multiple types of high dimensional genomic data. Particularly, we are interested in studying the influence of DNA copy number alterations on RNA transcript levels. For this purpose, we model the dependence of the RNA expression levels on DNA copy numbers through multivariate linear regressions and utilize proper regularization to deal with the high dimensionality as well as to incorporate desired network structures. Criteria for selecting the tuning parameters are also discussed. The performance of the proposed method is illustrated through extensive simulation studies. Finally, remMap is applied to a breast cancer study, in which genome wide RNA transcript levels and DNA copy numbers were measured for 172 tumor samples. We identify a trans-hub region in cytoband 17q12-q21, whose amplification influences the RNA expression levels of more than 30 unlinked genes. These findings may lead to a better understanding of breast cancer pathology.

Citation

Download Citation

Jie Peng. Ji Zhu. Anna Bergamaschi. Wonshik Han. Dong-Young Noh. Jonathan R. Pollack. Pei Wang. "Regularized multivariate regression for identifying master predictors with application to integrative genomics study of breast cancer." Ann. Appl. Stat. 4 (1) 53 - 77, March 2010. https://doi.org/10.1214/09-AOAS271

Information

Published: March 2010
First available in Project Euclid: 11 May 2010

zbMATH: 1189.62174
MathSciNet: MR2758084
Digital Object Identifier: 10.1214/09-AOAS271

Keywords: DNA copy number alteration , MAP (MAster Predictor) penalty , RNA transcript level , Sparse regression , v-fold cross validation

Rights: Copyright © 2010 Institute of Mathematical Statistics

Vol.4 • No. 1 • March 2010
Back to Top