Source: Ann. Statist. Volume 33, Number 1
(2005), 101-125.
Many statistical practices involve choosing between a full model and reduced models where some coefficients are reduced to zero. Data were used to select a model with estimated coefficients. Is it possible to do so and still come up with an estimator always better than the traditional estimator based on the full model? The James–Stein estimator is such an estimator, having a property called minimaxity. However, the estimator considers only one reduced model, namely the origin. Hence it reduces no coefficient estimator to zero or every coefficient estimator to zero. In many applications including wavelet analysis, what should be more desirable is to reduce to zero only the estimators smaller than a threshold, called thresholding in this paper. Is it possible to construct this kind of estimators which are minimax?
In this paper, we construct such minimax estimators which perform thresholding. We apply our recommended estimator to the wavelet analysis and show that it performs the best among the well-known estimators aiming simultaneously at estimation and model selection. Some of our estimators are also shown to be asymptotically optimal.
References
Anderson, T. W. (1955). The integral of a symmetric unimodal function over a symmetric convex set and some probability inequalities. Proc. Amer. Math. Soc. 6 170--176.
Mathematical Reviews (MathSciNet):
MR69229
Antoniadis, A. and Fan, J. (2001). Regularization of wavelet approximations (with discussion). J. Amer. Statist. Assoc. 96 939--967.
Antoniadis, A., Leporini, D. and Pesquet, J.-C. (2002). Wavelet thresholding for some classes of non-Gaussian noise. Statist. Neerlandica 56 434--453.
Beran, R. and Dümbgen, L. (1998). Modulation of estimators and confidence sets. Ann. Statist. 26 1826--1856.
Berger, J. (1976). Tail minimaxity in location vector problems and its applications. Ann. Statist. 4 33--50.
Mathematical Reviews (MathSciNet):
MR391319
Berger, J. (1980). Improving on inadmissible estimators in continuous exponential families with applications to simultaneous estimation of gamma scale parameters. Ann. Statist. 8 545--571.
Mathematical Reviews (MathSciNet):
MR568720
Brown, L. D. (1971). Admissible estimators, recurrent diffusions, and insoluble boundary value problems. Ann. Math. Statist. 42 855--903.
Mathematical Reviews (MathSciNet):
MR286209
Cai, T. (1999). Adaptive wavelet estimation: A block thresholding and oracle inequality approach. Ann. Statist. 27 898--924.
Donoho, D. L. and Johnstone, I. (1994). Ideal spatial adaptation via wavelet shrinkage. Biometrika 81 425--455.
Donoho, D. L. and Johnstone, I. (1995). Adapting to unknown smoothness via wavelet shrinkage. J. Amer. Statist. Assoc. 90 1200--1224.
Gao, H.-Y. (1998). Wavelet shrinkage denoising using the non-negative garrote. J. Comput. Graph. Statist. 7 469--488.
Gauch, H. (1993). Prediction, parsimony and noise. American Scientist 81 468--478.
George, E. I. (1986a). Minimax multiple shrinkage estimation. Ann. Statist. 14 188--205.
Mathematical Reviews (MathSciNet):
MR829562
George, E. I. (1986b). Combining minimax shrinkage estimators. J. Amer. Statist. Assoc. 81 437--445.
Mathematical Reviews (MathSciNet):
MR845881
James, W. and Stein, C. (1961). Estimation with quadratic loss. Proc. Fourth Berkeley Symp. Math. Statist. Probab. 1 361--379. Univ. California Press, Berkeley.
Mathematical Reviews (MathSciNet):
MR133191
Lehmann, E. L. (1983). Theory of Point Estimation. Wiley, New York
Mathematical Reviews (MathSciNet):
MR702834
Lehmann, E. L. and Casella, G. C. (1998). Theory of Point Estimation, 2nd ed. Springer, New York.
Mallat, S. G. (1989). A theory for multiresolution signal decomposition: The wavelet representation. IEEE Trans. Pattern Analysis Machine Intelligence 11 674--693.
Stein, C. (1981). Estimation of the mean of a multivariate normal distribution. Ann. Statist. 9 1135--1151.
Mathematical Reviews (MathSciNet):
MR630098
Vidakovic, B. (1999). Statistical Modeling by Wavelets. Wiley, New York.
Zhou, H. H. and Hwang, J. T. G. (2003). Minimax estimation with thresholding. Technical report, Cornell Statistical Center.