The Annals of Statistics

Empirical risk minimization in inverse problems

Jussi Klemelä and Enno Mammen
Source: Ann. Statist. Volume 38, Number 1 (2010), 482-511.

Abstract

We study estimation of a multivariate function f : RdR when the observations are available from the function Af, where A is a known linear operator. Both the Gaussian white noise model and density estimation are studied. We define an L2-empirical risk functional which is used to define a δ-net minimizer and a dense empirical risk minimizer. Upper bounds for the mean integrated squared error of the estimators are given. The upper bounds show how the difficulty of the estimation depends on the operator through the norm of the adjoint of the inverse of the operator and on the underlying function class through the entropy of the class. Corresponding lower bounds are also derived. As examples, we consider convolution operators and the Radon transform. In these examples, the estimators achieve the optimal rates of convergence. Furthermore, a new type of oracle inequality is given for inverse problems in additive models.

First Page: Show Hide
Primary Subjects: 62G07
Full-text: Open access
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.aos/1262271621
Digital Object Identifier: doi:10.1214/09-AOS726
Mathematical Reviews number (MathSciNet): MR2589328
Zentralblatt MATH identifier: 1181.62044

References

Bass, R. F. (1985). Law of the iterated logarithm for set-indexed partial sum processes with finite variance. Z. Wahrsch. Verw. Gebiete 65 181–237.
Mathematical Reviews (MathSciNet): MR807339
Bickel, P. J., Klaassen, C. A. J., Ritov, Y. and Wellner, J. A. (1993). Efficient and Adaptive Estimation for Semiparametric Models. Johns Hopkins Univ. Press, Baltimore, MD.
Mathematical Reviews (MathSciNet): MR1245941
Birgé, L. (1983). Approximation dans les espaces métriques et théorie de lestimation. Z. Wahrsch. Verw. Gebiete 70 591–608.
Mathematical Reviews (MathSciNet): MR722129
Birgé, L. and Massart, P. (1993). Rates of convergence for minimum contrast estimators. Probab. Theory Related Fields 97 113–150.
Mathematical Reviews (MathSciNet): MR1240719
Zentralblatt MATH: 0805.62037
Digital Object Identifier: doi:10.1007/BF01199316
Birman, M. S. and Solomyak, M. Z. (1967). Piecewise-polynomial approximations of functions of the classes Wpα. Mat. Sb. (N.S.) 73 331–355.
Cencov, N. N. (1972). Statistical Decision Rules and Optimal Inference. Nauka, Moscow.
Mathematical Reviews (MathSciNet): MR343398
Comte, F., Taupin, M.-L. and Rozenholc, Y. (2006). Penalized contrast estimator for density deconvolution. Canad. J. Statist. 34 431–452.
Mathematical Reviews (MathSciNet): MR2328553
Digital Object Identifier: doi:10.1002/cjs.5550340305
Deans, S. R. (1983). The Radon Transform and Some of Its Applications. Wiley, New York.
Mathematical Reviews (MathSciNet): MR709591
Donoho, D. L. (1995). Nonlinear solutions of linear inverse problems by wavelet-vaguelette decomposition. Appl. Comput. Harmon. Anal. 2 101–126.
Mathematical Reviews (MathSciNet): MR1325535
Digital Object Identifier: doi:10.1006/acha.1995.1008
Donoho, D. L. and Low, M. (1992). Renormalization exponents and optimal pointwise rates of convergence. Ann. Statist. 20 944–970.
Mathematical Reviews (MathSciNet): MR1165601
Zentralblatt MATH: 0797.62032
Digital Object Identifier: doi:10.1214/aos/1176348665
Project Euclid: euclid.aos/1176348665
Dudley, R. M. (1999). Uniform Central Limit Theorems. Cambridge Univ. Press, Cambridge.
Mathematical Reviews (MathSciNet): MR1720712
Ermakov, M. S. (1989). Minimax estimation of the solution of an ill-posed convolution type problem. Probl. Inf. Transm. 25 191–200.
Mathematical Reviews (MathSciNet): MR1021197
Hasminskii, R. Z. and Ibragimov, I. A. (1990). On density estimation in the view of Kolmogorov’s ideas in approximation theory. Ann. Statist. 18 999–1010.
Mathematical Reviews (MathSciNet): MR1062695
Digital Object Identifier: doi:10.1214/aos/1176347736
Project Euclid: euclid.aos/1176347736
Hastie, T. J. and Tibshirani, R. J. (1990). Generalized Additive Models. Chapman and Hall, London.
Mathematical Reviews (MathSciNet): MR1082147
Ibragimov, I. A. (2004). Estimation of multivariate regression. Theory Probab. Appl. 48 256–272.
Ibragimov, I. A. and Hasminskii, R. Z. (1980). On estimate of the density function. Zap. Nauchn. Sem. Leningrad. Otdel. Mat. Inst. Steklov. (LOMI) 98 61–85.
Mathematical Reviews (MathSciNet): MR591862
Ibragimov, I. A. and Hasminskii, R. Z. (1981). On the nonparametric density estimates. Zap. Nauchn. Sem. Leningrad. Otdel. Mat. Inst. Steklov. (LOMI) 108 73–89.
Johnstone, I. M. and Silverman, B. W. (1990). Speed of estimation in positron emission tomography and related inverse problems. Ann. Statist. 18 251–280.
Mathematical Reviews (MathSciNet): MR1041393
Zentralblatt MATH: 0699.62043
Digital Object Identifier: doi:10.1214/aos/1176347500
Project Euclid: euclid.aos/1176347500
Klemelä, J. and Mammen, E. (2009). Empirical risk minimization in inverse problems: Extended technical version. Available at arXiv:0904.2977v1.
Mathematical Reviews (MathSciNet): MR2589328
Zentralblatt MATH: 1181.62044
Digital Object Identifier: doi:10.1214/09-AOS726
Project Euclid: euclid.aos/1262271621
Kolmogorov, A. N. and Tikhomirov, V. M. (1961). ε-entropy and ε-capacity of sets in function spaces. Amer. Math. Soc. Transl. Ser. 2 17 277–364.
Mathematical Reviews (MathSciNet): MR124455
Zentralblatt MATH: 0128.27803
Koo, J. Y. (1993). Optimal rates of convergence for nonparametric statistical inverse problems. Ann. Statist. 21 590–599.
Mathematical Reviews (MathSciNet): MR1232506
Zentralblatt MATH: 0778.62040
Digital Object Identifier: doi:10.1214/aos/1176349138
Project Euclid: euclid.aos/1176349138
Korostelev, A. P. and Tsybakov, A. B. (1991). Optimal rates of convergence of estimators in a probabilistic setup of tomography problem. Problemy Peredachi Informatsii 27 73–81.
Mathematical Reviews (MathSciNet): MR1294566
Korostelev, A. P. and Tsybakov, A. B. (1993). Minimax Theory of Image Reconstruction. Lecture Notes in Statistics 82. Springer, Berlin.
Mathematical Reviews (MathSciNet): MR1226450
Zentralblatt MATH: 0833.62039
Le Cam, L. (1973). Convergence of estimates under dimensionality restrictions. Ann. Statist. 1 38–53.
Mathematical Reviews (MathSciNet): MR334381
Digital Object Identifier: doi:10.1214/aos/1193342380
Project Euclid: euclid.aos/1193342380
Mammen, E., Linton, O. and Nielsen, J. (1999). The existence and asymptotic properties of a backfitting projection algorithm under weak conditions. Ann. Statist. 27 1443–1490.
Mathematical Reviews (MathSciNet): MR1742496
Zentralblatt MATH: 0986.62028
Project Euclid: euclid.aos/1017939138
Ossiander, M. (1987). A central limit theorem under metric entropy with L2 bracketing. Ann. Probab. 15 897–919.
Mathematical Reviews (MathSciNet): MR893905
Zentralblatt MATH: 0665.60036
Digital Object Identifier: doi:10.1214/aop/1176992072
Project Euclid: euclid.aop/1176992072
O’Sullivan, F. (1986). A statistical perspective on ill-posed inverse problems. Statist. Sci. 1 502–527.
Mathematical Reviews (MathSciNet): MR874480
Digital Object Identifier: doi:10.1214/ss/1177013525
Project Euclid: euclid.ss/1177013525
Stone, C. J. (1985). Additive regression and other nonparametric models. Ann. Statist. 13 689–705.
Mathematical Reviews (MathSciNet): MR790566
Zentralblatt MATH: 0605.62065
Digital Object Identifier: doi:10.1214/aos/1176349548
Project Euclid: euclid.aos/1176349548
Tsybakov, A. B. (1998). Pointwise and sup-norm sharp adaptive estimation of functions on the Sobolev classes. Ann. Statist. 26 2420–2469.
Mathematical Reviews (MathSciNet): MR1700239
Zentralblatt MATH: 0933.62028
Digital Object Identifier: doi:10.1214/aos/1024691478
Project Euclid: euclid.aos/1024691478
Van de Geer, S. A. (2000). Empirical Processes in M-Estimation. Cambridge Univ. Press, Cambridge.
Zentralblatt MATH: 1179.62073
Van der Laan, M. J., Dudoit, S. and van der Vaart, A. W. (2004). The cross-validated adaptive epsilon-net estimator. Working paper, U.C. Berkeley Division of Biostatistics.
Yang, Y. and Barron, A. (1999). Information-theoretic determination of minimax rates of convergence. Ann. Statist. 27 1564–1599.
Mathematical Reviews (MathSciNet): MR1742500
Zentralblatt MATH: 0978.62008
Digital Object Identifier: doi:10.1214/aos/1017939142
Project Euclid: euclid.aos/1017939142

2013 © Institute of Mathematical Statistics

The Annals of Statistics

The Annals of Statistics

Turn MathJax Off
What is MathJax?