The Annals of Statistics
- Ann. Statist.
- Volume 40, Number 2 (2012), 812-831.
Estimation in high-dimensional linear models with deterministic design matrices
Because of the advance in technologies, modern statistical studies often encounter linear models with the number of explanatory variables much larger than the sample size. Estimation and variable selection in these high-dimensional problems with deterministic design points is very different from those in the case of random covariates, due to the identifiability of the high-dimensional regression parameter vector. We show that a reasonable approach is to focus on the projection of the regression parameter vector onto the linear space generated by the design matrix. In this work, we consider the ridge regression estimator of the projection vector and propose to threshold the ridge regression estimator when the projection vector is sparse in the sense that many of its components are small. The proposed estimator has an explicit form and is easy to use in application. Asymptotic properties such as the consistency of variable selection and estimation and the convergence rate of the prediction mean squared error are established under some sparsity conditions on the projection vector. A simulation study is also conducted to examine the performance of the proposed estimator.
Ann. Statist. Volume 40, Number 2 (2012), 812-831.
First available in Project Euclid: 17 May 2012
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Shao, Jun; Deng, Xinwei. Estimation in high-dimensional linear models with deterministic design matrices. Ann. Statist. 40 (2012), no. 2, 812--831. doi:10.1214/12-AOS982. http://projecteuclid.org/euclid.aos/1337268213.