Bernoulli

Sharper lower bounds on the performance of the empirical risk minimization algorithm

Guillaume Lecué and Shahar Mendelson
Source: Bernoulli Volume 16, Number 3 (2010), 605-613.

Abstract

We present an argument based on the multidimensional and the uniform central limit theorems, proving that, under some geometrical assumptions between the target function T and the learning class F, the excess risk of the empirical risk minimization algorithm is lower bounded by

\[\frac{\mathbb{E}\sup_{q\in Q}G_{q}}{\sqrt{n}}\delta\],

where (Gq)qQ is a canonical Gaussian process associated with Q (a well chosen subset of F) and δ is a parameter governing the oscillations of the empirical excess risk function over a small ball in F.

First Page: Show Hide
Full-text: Access denied (no subscription detected)
We're sorry, but we are unable to provide you with the full text of this article because we are not able to identify you as a subscriber.
If you have a personal subscription to this journal, then please login. If you are already logged in, then you may need to update your profile to register your subscription. Read more about accessing full-text
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.bj/1281099877
Digital Object Identifier: doi:10.3150/09-BEJ225
Mathematical Reviews number (MathSciNet): MR2730641
Zentralblatt MATH identifier: 05945278

References

[1] Bartlett, P.L. and Mendelson, S. (2006). Empirical minimization., Probab. Theory Related Fields 135 311–334.
Mathematical Reviews (MathSciNet): MR2240689
Zentralblatt MATH: 1142.62348
Digital Object Identifier: doi:10.1007/s00440-005-0462-3
[2] Dudley, R.M. (1999)., Uniform Central Limit Theorems. Cambridge Studies in Advanced Mathematics 63. Cambridge: Cambridge Univ. Press.
Mathematical Reviews (MathSciNet): MR1720712
[3] Koltchinskii, V. (2006). Local Rademacher complexities and oracle inequalities in risk minimization., Ann. Statist. 34 2593–2656.
Mathematical Reviews (MathSciNet): MR2329442
Zentralblatt MATH: 1118.62065
Digital Object Identifier: doi:10.1214/009053606000001019
Project Euclid: euclid.aos/1179935055
[4] Lecué, G. (2007). Suboptimality of penalized empirical risk minimization in classification. In, 20th Annual Conference On Learning Theory, COLT07 (G. Bshouty, ed.). LNAI 4539 142–156. Berlin: Springer.
Mathematical Reviews (MathSciNet): MR2397584
[5] Lee, W.S., Bartlett, P.L. and Williamson, R.C. (1998). The importance of convexity in learning with squared loss., IEEE Trans. Inform. Theory 44 1974–1980.
Mathematical Reviews (MathSciNet): MR1664079
Digital Object Identifier: doi:10.1109/18.705577
[6] Massart P. and Nédélec, É. (2006). Risk bounds for statistical learning., Ann. Statist. 34 2326–2366.
Mathematical Reviews (MathSciNet): MR2291502
Zentralblatt MATH: 1108.62007
Digital Object Identifier: doi:10.1214/009053606000000786
Project Euclid: euclid.aos/1169571799
[7] Mendelson, S. (2008). Lower bounds for the empirical minimization algorithm., IEEE Trans. Inform. Theory. 54 3797–3803.
Mathematical Reviews (MathSciNet): MR2451042
Digital Object Identifier: doi:10.1109/TIT.2008.926323
[8] Talagrand, M. (2005)., The Generic Chaining. Springer Monographs in Mathematics. Berlin: Springer-Verlag.
Mathematical Reviews (MathSciNet): MR2133757
[9] van der Vaart, A.W. and Wellner, J.A. (1996)., Weak Convergence and Empirical Processes. Springer Series in Statistics. New York: Springer-Verlag.
Mathematical Reviews (MathSciNet): MR1385671
Zentralblatt MATH: 0862.60002
[10] Vapnik, V.N. (1998)., Statistical Learning Theory. Adaptive and Learning Systems for Signal Processing, Communications, and Control. New York: Wiley.
Mathematical Reviews (MathSciNet): MR1641250

2013 © Bernoulli Society for Mathematical Statistics and Probability

Bernoulli

Bernoulli

Turn MathJax Off
What is MathJax?