The Annals of Statistics
- Ann. Statist.
- Volume 37, Number 2 (2009), 673-696.
SCAD-penalized regression in high-dimensional partially linear models
We consider the problem of simultaneous variable selection and estimation in partially linear models with a divergent number of covariates in the linear part, under the assumption that the vector of regression coefficients is sparse. We apply the SCAD penalty to achieve sparsity in the linear part and use polynomial splines to estimate the nonparametric component. Under reasonable conditions, it is shown that consistency in terms of variable selection and estimation can be achieved simultaneously for the linear and nonparametric components. Furthermore, the SCAD-penalized estimators of the nonzero coefficients are shown to have the asymptotic oracle property, in the sense that it is asymptotically normal with the same means and covariances that they would have if the zero coefficients were known in advance. The finite sample behavior of the SCAD-penalized estimators is evaluated with simulation and illustrated with a data set.
Ann. Statist., Volume 37, Number 2 (2009), 673-696.
First available in Project Euclid: 10 March 2009
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Xie, Huiliang; Huang, Jian. SCAD-penalized regression in high-dimensional partially linear models. Ann. Statist. 37 (2009), no. 2, 673--696. doi:10.1214/07-AOS580. https://projecteuclid.org/euclid.aos/1236693146