Bernoulli

The Dantzig selector and sparsity oracle inequalities

Vladimir Koltchinskii
Source: Bernoulli Volume 15, Number 3 (2009), 799-828.

Abstract

Let

Yj=f*(Xj)+ξj,  j=1, …, n,

where X, X1, …, Xn are i.i.d. random variables in a measurable space $(S,\mathcal{A})$ with distribution Π and ξ, ξ1, …, ξn are i.i.d. random variables with ${\mathbb{E}}\xi=0$ independent of (X1, …, Xn). Given a dictionary h1, …, hN: S↦ℝ, let fλ:=∑j=1Nλjhj, λ=(λ1, …, λN)∈ℝN. Given ɛ>0, define

̂Λɛ:={λ∈ℝN: max1≤kN|n−1j=1n(fλ(Xj)−Yj)hk(Xj)|≤ɛ}

and

̂λ:=̂λɛ∈Argminλ̂Λɛλ1.

In the case where f*:=fλ*, λ*∈ℝN, Candes and Tao [Ann. Statist. 35 (2007) 2313–2351] suggested using ̂λ as an estimator of λ*. They called this estimator “the Dantzig selector”. We study the properties of f̂λ as an estimator of f* for regression models with random design, extending some of the results of Candes and Tao (and providing alternative proofs of these results).

First Page: Show Hide
Full-text: Access denied (no subscription detected)
We're sorry, but we are unable to provide you with the full text of this article because we are not able to identify you as a subscriber.
If you have a personal subscription to this journal, then please login. If you are already logged in, then you may need to update your profile to register your subscription. Read more about accessing full-text
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.bj/1251463282
Digital Object Identifier: doi:10.3150/09-BEJ187
Mathematical Reviews number (MathSciNet): MR2555200
Zentralblatt MATH identifier: 05815956

References

Bickel, P., Ritov, Y. and Tsybakov, A. (2009). Simultaneous analysis of LASSO and Dantzig selector. Ann. Statist. To appear.
Mathematical Reviews (MathSciNet): MR2533469
Zentralblatt MATH: 1173.62022
Digital Object Identifier: doi:10.1214/08-AOS620
Project Euclid: euclid.aos/1245332830
Bobkov, S. and Houdré, C. (1997). Isoperimetric constants for product probability measures. Ann. Probab. 25 184–205.
Mathematical Reviews (MathSciNet): MR1428505
Zentralblatt MATH: 0878.60013
Digital Object Identifier: doi:10.1214/aop/1024404284
Project Euclid: euclid.aop/1024404284
Bunea, F., Tsybakov, A. and Wegkamp, M. (2007). Sparsity oracle inequalities for the LASSO. Electron. J. Statist. 1 169–194.
Mathematical Reviews (MathSciNet): MR2312149
Zentralblatt MATH: 1146.62028
Digital Object Identifier: doi:10.1214/07-EJS008
Project Euclid: euclid.ejs/1179759718
Candes, E., Romberg, J. and Tao, T. (2006). Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information. IEEE Trans. Inform. Theory 52 489–509.
Mathematical Reviews (MathSciNet): MR2236170
Digital Object Identifier: doi:10.1109/TIT.2005.862083
Candes, E. and Tao, T. (2005). Decoding by linear programming. IEEE Trans. Inform. Theory 51 4203–4215.
Mathematical Reviews (MathSciNet): MR2243152
Digital Object Identifier: doi:10.1109/TIT.2005.858979
Candes, E. and Tao, T. (2007). The Dantzig selector: Statistical estimation when p is much larger than n. Ann. Statist. 35 2313–2351.
Mathematical Reviews (MathSciNet): MR2382644
Zentralblatt MATH: 1139.62019
Digital Object Identifier: doi:10.1214/009053606000001523
Project Euclid: euclid.aos/1201012958
de la Pena, V. and Giné, E. (1998). Decoupling: From Dependence to Independence. New York: Springer.
Mathematical Reviews (MathSciNet): MR1666908
Donoho, D.L. (2006a). For most large underdetermined systems of linear equations the minimal 1-norm solution is also the sparsest solution. Commun. Pure Appl. Math. 59 797–829.
Donoho, D.L. (2006b). For most large underdetermined systems of equations the minimal 1-norm near-solution approximates the sparsest near-solution. Commun. Pure Appl. Math. 59 907–934.
Koltchinskii, V. (2005). Model selection and aggregation in sparse classification problems. In Oberwolfach Reports: Meeting on Statistical and Probabilistic Methods of Model Selection, October 2005. European Mathematical Society Publishing House.
Koltchinskii, V. (2009). Sparsity in penalized empirical risk minimization. Ann. Inst. H. Poincaré Probab. Statist. 45 7–57.
Mathematical Reviews (MathSciNet): MR2500227
Zentralblatt MATH: 1168.62044
Digital Object Identifier: doi:10.1214/07-AIHP146
Project Euclid: euclid.aihp/1234469970
Ledoux, M. and Talagrand, M. (1991). Probability in Banach Spaces. New York: Springer.
Mathematical Reviews (MathSciNet): MR1102015
Mendelson, S., Pajor, A. and Tomczak-Jaegermann, N. (2007). Reconstruction and subgaussian operators in asymptotic geometric analysis. Geom. Funct. Anal. 17 1248–1282.
Mathematical Reviews (MathSciNet): MR2373017
Zentralblatt MATH: 1163.46008
Digital Object Identifier: doi:10.1007/s00039-007-0618-7
Rudelson, M. and Vershynin, R. (2005). Geometric approach to error correcting codes and reconstruction of signals. Int. Math. Res. Not. 64 4019–4041.
Mathematical Reviews (MathSciNet): MR2206919
Zentralblatt MATH: 1103.94014
Digital Object Identifier: doi:10.1155/IMRN.2005.4019
van de Geer, S. (2008). High-dimensional generalized linear models and the Lasso. Ann. Statist. 36 614–645.
Mathematical Reviews (MathSciNet): MR2396809
Zentralblatt MATH: 1138.62323
Digital Object Identifier: doi:10.1214/009053607000000929
Project Euclid: euclid.aos/1205420513
van der Vaart, A. and Wellner, J. (1996). Weak Convergence and Empirical Processes. New York: Springer.
Mathematical Reviews (MathSciNet): MR1385671
Zentralblatt MATH: 0862.60002

2012 © Bernoulli Society for Mathematical Statistics and Probability

Bernoulli

Bernoulli

Turn MathJax Off
What is MathJax?