Composite quantile regression and the oracle model selection theory



The Annals of Statistics

Composite quantile regression and the oracle model selection theory

Hui Zou and Ming Yuan

Source: Ann. Statist. Volume 36, Number 3 (2008), 1108-1126.

Abstract

Coefficient estimation and variable selection in multiple linear regression is routinely done in the (penalized) least squares (LS) framework. The concept of model selection oracle introduced by Fan and Li [J. Amer. Statist. Assoc. 96 (2001) 1348–1360] characterizes the optimal behavior of a model selection procedure. However, the least-squares oracle theory breaks down if the error variance is infinite. In the current paper we propose a new regression method called composite quantile regression (CQR). We show that the oracle model selection theory using the CQR oracle works beautifully even when the error variance is infinite. We develop a new oracular procedure to achieve the optimal properties of the CQR oracle. When the error variance is finite, CQR still enjoys great advantages in terms of estimation efficiency. We show that the relative efficiency of CQR compared to the least squares is greater than 70% regardless the error distribution. Moreover, CQR could be much more efficient and sometimes arbitrarily more efficient than the least squares. The same conclusions hold when comparing a CQR-oracular estimator with a LS-oracular estimator.

Primary Subjects: 62J05
Secondary Subjects: 62J07
Keywords: Asymptotic efficiency; linear program; model selection; oracle properties; universal lower bound

Full-text: Access denied (no subscription detected)

We're sorry, but we are unable to provide you with the full text of this article because we are not able to identify you as a subscriber.
If you have a personal subscription to this journal, then please login. If you are already logged in, then you may need to update your profile to register your subscription. Read more about accessing full-text
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.aos/1211819558
Digital Object Identifier: doi:10.1214/07-AOS507
Mathematical Reviews number (MathSciNet): MR2418651
Zentralblatt MATH identifier: 05294967

References

Breiman, L. (1995). Better subset regression using the nonnegative garrote. Technometrics 37 373–384.
Mathematical Reviews (MathSciNet): MR1365720
Digital Object Identifier: doi:10.2307/1269730
Fan, J. and Li, R. (2001). Variable selection via nonconcave penalized likelihood and its oracle properties. J. Amer. Statist. Assoc. 96 1348–1360.
Mathematical Reviews (MathSciNet): MR1946581
Digital Object Identifier: doi:10.1198/016214501753382273
Fan, J. and Li, R. (2006). Statistical challenges with high dimensionality: Feature selection in knowledge discovery. Proceedings of the Madrid International Congress of Mathematicians 2006 III 595–622. EMS, Zurich.
Mathematical Reviews (MathSciNet): MR2275698
Feller, W. (1968). An Introduction to Probability Theory and Its Applications. 1, 3rd ed. Wiley, New York.
Mathematical Reviews (MathSciNet): MR228020
Knight, K. (1998). Limiting distributions for l1 regression estimators under general conditions. Ann. Statist. 26 755–770.
Mathematical Reviews (MathSciNet): MR1626024
Digital Object Identifier: doi:10.1214/aos/1028144858
Project Euclid: euclid.aos/1028144858
Koenker, R. (2005). Quantile Regression. Cambridge Univ. Press.
Mathematical Reviews (MathSciNet): MR2268657
Koenker, R. and Geling, R. (2001). Reappraising medfly longevity: A quantile regression survival analysis. J. Amer. Statist. Assoc. 96 458–468.
Mathematical Reviews (MathSciNet): MR1939348
Digital Object Identifier: doi:10.1198/016214501753168172
Koenker, R. and Hallock, K. (2001). Quantile regression. J. Economic Perspectives 15 143–156.
Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. J. Roy. Statist. Soc. Ser. B 58 267–288.
Mathematical Reviews (MathSciNet): MR1379242
Zou, H. (2006). The adaptive lasso and its oracle properties. J. Amer. Statist. Assoc. 101 1418–1429.
Mathematical Reviews (MathSciNet): MR2279469
Digital Object Identifier: doi:10.1198/016214506000000735
Zou, H. and Yuan, M. (2007). Composite quantile regression and the oracle model selection theory. Technical report, Univ. Minnesota.
Mathematical Reviews (MathSciNet): MR2418651
Digital Object Identifier: doi:10.1214/07-AOS507
Project Euclid: euclid.aos/1211819558

2009 © Institute of Mathematical Statistics