The Annals of Statistics

On the adaptive elastic-net with a diverging number of parameters

Hui Zou and Hao Helen Zhang
Source: Ann. Statist. Volume 37, Number 4 (2009), 1733-1751.

Abstract

We consider the problem of model selection and estimation in situations where the number of parameters diverges with the sample size. When the dimension is high, an ideal method should have the oracle property [J. Amer. Statist. Assoc. 96 (2001) 1348–1360] and [Ann. Statist. 32 (2004) 928–961] which ensures the optimal large sample performance. Furthermore, the high-dimensionality often induces the collinearity problem, which should be properly handled by the ideal method. Many existing variable selection methods fail to achieve both goals simultaneously. In this paper, we propose the adaptive elastic-net that combines the strengths of the quadratic regularization and the adaptively weighted lasso shrinkage. Under weak regularity conditions, we establish the oracle property of the adaptive elastic-net. We show by simulations that the adaptive elastic-net deals with the collinearity problem better than the other oracle-like methods, thus enjoying much improved finite sample performance.

First Page: Show Hide
Primary Subjects: 62J05
Secondary Subjects: 62J07
Full-text: Open access
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.aos/1245332831
Digital Object Identifier: doi:10.1214/08-AOS625
Zentralblatt MATH identifier: 05582009
Mathematical Reviews number (MathSciNet): MR2533470

References

Breiman, L. (1996). Heuristics of instability and stabilization in model selection. Ann. Statist. 24 2350–2383.
Mathematical Reviews (MathSciNet): MR1425957
Zentralblatt MATH: 0867.62055
Digital Object Identifier: doi:10.1214/aos/1032181158
Project Euclid: euclid.aos/1032181158
Candes, E. and Tao, T. (2007). The Dantzig selector: Statistical estimation when p is much larger than n. Ann. Statist. 35 2313–2351.
Mathematical Reviews (MathSciNet): MR2382644
Zentralblatt MATH: 1139.62019
Digital Object Identifier: doi:10.1214/009053606000001523
Project Euclid: euclid.aos/1201012958
Candes, E., Wakin, M. and Boyd, S. (2008). Enhancing sparsity by reweighted l1 minimization. J. Fourier Anal. Appl. To appear.
Mathematical Reviews (MathSciNet): MR2461611
Zentralblatt MATH: 1176.94014
Digital Object Identifier: doi:10.1007/s00041-008-9045-x
Donoho, D. and Johnstone, I. (1994). Ideal spatial adaptation via wavelet shrinkage. Biometrika 81 425–455.
Mathematical Reviews (MathSciNet): MR1311089
Zentralblatt MATH: 0815.62019
Digital Object Identifier: doi:10.1093/biomet/81.3.425
Donoho, D., Johnstone, I., Kerkyacharian, G. and Picard, D. (1995). Wavelet shrinkage: Asymptopia? (with discussion). J. Roy. Statist. Soc. Ser. B 57 301–337.
Mathematical Reviews (MathSciNet): MR1323344
Efron, B., Hastie, T., Johnstone, I. and Tibshirani, R. (2004). Least angle regression. Ann. Statist. 32 407–499.
Mathematical Reviews (MathSciNet): MR2060166
Zentralblatt MATH: 1091.62054
Digital Object Identifier: doi:10.1214/009053604000000067
Project Euclid: euclid.aos/1083178935
Fan, J. and Li, R. (2001). Variable selection via nonconcave penalized likelihood and its oracle properties. J. Amer. Statist. Assoc. 96 1348–1360.
Mathematical Reviews (MathSciNet): MR1946581
Zentralblatt MATH: 1073.62547
Digital Object Identifier: doi:10.1198/016214501753382273
Fan, J. and Li, R. (2006). Statistical challenges with high dimensionality: Feature selection in knowledge discovery. In International Congress of Mathematicians 3 595–622.
Mathematical Reviews (MathSciNet): MR2275698
Zentralblatt MATH: 1117.62137
Fan, J. and Lv, J. (2008). Sure independence screening for ultra-high-dimensional feature space. J. Roy. Statist. Soc. Ser. B 70 849–911.
Fan, J. and Peng, H. (2004). Nonconcave penalized likelihood with a diverging number of parameters. Ann. Statist. 32 928–961.
Mathematical Reviews (MathSciNet): MR2065194
Zentralblatt MATH: 1092.62031
Digital Object Identifier: doi:10.1214/009053604000000256
Project Euclid: euclid.aos/1085408491
Fan, J., Peng, H. and Huang, T. (2005). Semilinear high-dimensional model for normalization of microarray data: A theoretical analysis and partial consistency (with discussion). J. Amer. Statist. Assoc. 100 781–813.
Mathematical Reviews (MathSciNet): MR2201010
Zentralblatt MATH: 1117.62330
Digital Object Identifier: doi:10.1198/016214504000001781
Huber, P. (1988). Robust regression: Asymptotics, conjectures and Monte Carlo. Ann. Statist. 1 799–821.
Mathematical Reviews (MathSciNet): MR356373
Zentralblatt MATH: 0289.62033
Digital Object Identifier: doi:10.1214/aos/1176342503
Project Euclid: euclid.aos/1176342503
Knight, K. and Fu, W. (2000). Asymptotics for lasso-type estimators. Ann. Statist. 28 1356–1378.
Mathematical Reviews (MathSciNet): MR1805787
Zentralblatt MATH: 1105.62357
Digital Object Identifier: doi:10.1214/aos/1015957397
Project Euclid: euclid.aos/1015957397
Lam, C. and Fan, J. (2008). Profile-kernel likelihood inference with diverging number of parameters. Ann. Statist. 36 2232–2260.
Mathematical Reviews (MathSciNet): MR2458186
Zentralblatt MATH: 05368490
Digital Object Identifier: doi:10.1214/07-AOS544
Project Euclid: euclid.aos/1223908091
Portnoy, S. (1984). Asymptotic behavior of M-estimatiors of p regression parameters when p2/n is large. I. Consistency. Ann. Statist. 12 1298–1309.
Mathematical Reviews (MathSciNet): MR760690
Zentralblatt MATH: 0584.62050
Digital Object Identifier: doi:10.1214/aos/1176346793
Project Euclid: euclid.aos/1176346793
Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. J. Roy. Statist. Soc. Ser. B 58 267–288.
Mathematical Reviews (MathSciNet): MR1379242
Wang, H., Li, R. and Tsai, C. (2007). Tuning parameter selectors for the smoothly clipped absolute deviation method. Biometrika 94 553–568.
Mathematical Reviews (MathSciNet): MR2410008
Zentralblatt MATH: 1135.62058
Digital Object Identifier: doi:10.1093/biomet/asm053
Zou, H. (2006). The adaptive lasso and its oracle properties. J. Amer. Statist. Assoc. 101 1418–1429.
Mathematical Reviews (MathSciNet): MR2279469
Zentralblatt MATH: 1171.62326
Digital Object Identifier: doi:10.1198/016214506000000735
Zou, H. and Hastie, T. (2005). Regularization and variable selection via the elastic net. J. R. Stat. Soc. Ser. B Stat. Methodol. 67 301–320.
Mathematical Reviews (MathSciNet): MR2137327
Zentralblatt MATH: 1069.62054
Digital Object Identifier: doi:10.1111/j.1467-9868.2005.00503.x
Zou, H., Hastie, T. and Tibshirani, R. (2007). On the degrees of freedom of the lasso. Ann. Statist. 35 2173–2192.
Mathematical Reviews (MathSciNet): MR2363967
Zentralblatt MATH: 1126.62061
Digital Object Identifier: doi:10.1214/009053607000000127
Project Euclid: euclid.aos/1194461726

2013 © Institute of Mathematical Statistics

The Annals of Statistics

The Annals of Statistics

Turn MathJax Off
What is MathJax?