The Annals of Statistics

Hybrid shrinkage estimators using penalty bases for the ordinal one-way layout

Rudolf Beran
Source: Ann. Statist. Volume 32, Number 6 (2004), 2532-2558.

Abstract

This paper constructs improved estimators of the means in the Gaussian saturated one-way layout with an ordinal factor. The least squares estimator for the mean vector in this saturated model is usually inadmissible. The hybrid shrinkage estimators of this paper exploit the possibility of slow variation in the dependence of the means on the ordered factor levels but do not assume it and respond well to faster variation if present. To motivate the development, candidate penalized least squares (PLS) estimators for the mean vector of a one-way layout are represented as shrinkage estimators relative to the penalty basis for the regression space. This canonical representation suggests further classes of candidate estimators for the unknown means: monotone shrinkage (MS) estimators or soft-thresholding (ST) estimators or, most generally, hybrid shrinkage (HS) estimators that combine the preceding two strategies. Adaptation selects the estimator within a candidate class that minimizes estimated risk. Under the Gaussian saturated one-way layout model, such adaptive estimators minimize risk asymptotically over the class of candidate estimators as the number of factor levels tends to infinity. Thereby, adaptive HS estimators asymptotically dominate adaptive MS and adaptive ST estimators as well as the least squares estimator. Local annihilators of polynomials, among them difference operators, generate penalty bases suitable for a range of numerical examples. In case studies, adaptive HS estimators recover high frequency details in the mean vector more reliably than PLS or MS estimators and low frequency details more reliably than ST estimators.

First Page: Show Hide
Primary Subjects: 62J07
Full-text: Open access
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.aos/1107794878
Digital Object Identifier: doi:10.1214/009053604000000652
Mathematical Reviews number (MathSciNet): MR2153994

References

Beran, R. (1996). Confidence sets centered at $C_p$ estimators. Ann. Inst. Statist. Math. 48 1--15.
Mathematical Reviews (MathSciNet): MR1392512
Digital Object Identifier: doi:10.1007/BF00049285
Beran, R. (2000). REACT scatterplot smoothers: Superefficiency through basis economy. J. Amer. Statist. Assoc. 95 155--171.
Mathematical Reviews (MathSciNet): MR1803148
Beran, R. (2001). Discussion of ``Local extremes, runs, strings and multiresolution,'' by P. L. Davies and A. Kovac. Ann. Statist. 29 48--52.
Mathematical Reviews (MathSciNet): MR1833958
Digital Object Identifier: doi:10.1214/aos/996986500
Project Euclid: euclid.aos/996986501
Beran, R. and Dümbgen, L. (1998). Modulation of estimators and confidence sets. Ann. Statist. 26 1826--1856.
Mathematical Reviews (MathSciNet): MR1673280
Digital Object Identifier: doi:10.1214/aos/1024691359
Project Euclid: euclid.aos/1024691359
Brillinger, D. R. and Tukey, J. W. (1985). Spectrum analysis in the presence of noise: Some issues and examples. In The Collected Works of John W. Tukey II. Time Series: 1965--1984 (D. R. Brillinger, ed.) 1001--1141. Wadsworth, Monterey, CA.
Brockwell, P. J. and Davis, R. A. (1996). Introduction to Time Series and Forecasting. Springer, New York.
Mathematical Reviews (MathSciNet): MR1416563
Zentralblatt MATH: 0868.62067
Buja, A., Hastie, T. and Tibshirani, R. (1989). Linear smoothers and additive models (with discussion). Ann. Statist. 17 453--555.
Mathematical Reviews (MathSciNet): MR994249
Donoho, D. L. and Johnstone, I. M. (1995). Adapting to unknown smoothness via wavelet shrinkage. J. Amer. Statist. Assoc. 90 1200--1224.
Mathematical Reviews (MathSciNet): MR1379464
Efromovich, S. (1999). Quasi-linear wavelet estimation. J. Amer. Statist. Assoc. 94 189--204.
Mathematical Reviews (MathSciNet): MR1689224
Efromovich, S. and Pinsker, M. S. (1984). Learning algorithm for nonparametric filtering. Automat. Remote Control 1984 1434--1440.
Friedman, J. H. (2001). The role of statistics in the data revolution? Internat. Statist. Rev. 69 5--10.
Golubev, G. K. (1987). Adaptive asymptotically minimax estimators of smooth signals. Problems Inform. Transmission 23 47--55.
Mathematical Reviews (MathSciNet): MR893970
James, W. and Stein, C. (1961). Estimation with quadratic loss. Proc. Fourth Berkeley Symp. Math. Statist. Probab. 1 361--379. Univ. California Press, Berkeley.
Mathematical Reviews (MathSciNet): MR133191
Kneip, A. (1994). Ordered linear smoothers. Ann. Statist. 22 835--866.
Mathematical Reviews (MathSciNet): MR1292543
Li, K.-C. and Hwang, J. T. (1984). The data-smoothing aspect of Stein estimates. Ann. Statist. 12 887--897.
Mathematical Reviews (MathSciNet): MR751280
Mallows, C. L. (1973). Some comments on $C_p$. Technometrics 15 661--676.
Pinsker, M. S. (1980). Optimal filtration of square-integrable signals in Gaussian noise. Problems Inform. Transmission 16 120--133.
Mathematical Reviews (MathSciNet): MR624591
Zentralblatt MATH: 0452.94003
Pisier, G. (1983). Some applications of the metric entropy condition to harmonic analysis. Banach Spaces, Harmonic Analysis, and Probability Theory. Lecture Notes in Math. 995 123--154. Springer, Berlin.
Mathematical Reviews (MathSciNet): MR717231
Zentralblatt MATH: 0517.60043
Pollard, D. (1990). Empirical Processes: Theory and Applications. IMS, Hayward, CA.
Mathematical Reviews (MathSciNet): MR1089429
Zentralblatt MATH: 0741.60001
Press, W. H., Teukolsky, S. A., Vetterling, W. T. and Flannery, B. P. (1992). Numerical Recipes in FORTRAN: The Art of Scientific Computing, 2nd ed. Cambridge Univ. Press.
Mathematical Reviews (MathSciNet): MR1196230
Robertson, T., Wright, F. T. and Dykstra, R. L. (1988). Order Restricted Statistical Inference. Wiley, New York.
Mathematical Reviews (MathSciNet): MR961262
Zentralblatt MATH: 0645.62028
Stein, C. (1956). Inadmissibility of the usual estimator for the mean of a multivariate normal distribution. Proc. Third Berkeley Symp. Math. Statist. Probab. 1 197--206. Univ. California Press, Berkeley.
Mathematical Reviews (MathSciNet): MR84922
Zentralblatt MATH: 0073.35602
Stein, C. (1981). Estimation of the mean of a multivariate normal distribution. Ann. Statist. 9 1135--1151.
Mathematical Reviews (MathSciNet): MR630098
Tukey, J. W. (1977). Exploratory Data Analysis. Addison--Wesley, Reading, MA.
Zentralblatt MATH: 0409.62003
Tukey, J. W. (1980). Methodological comments focused on opportunities. In Multivariate Techniques in Human Communication Research (P. R. Monge and J. N. Cappella, eds.) 490--528. Academic Press, New York.
Zentralblatt MATH: 0465.62110

2012 © Institute of Mathematical Statistics

The Annals of Statistics

The Annals of Statistics