Source: Ann. Statist. Volume 32, Number 6
(2004), 2532-2558.
This paper constructs improved estimators of the means in the Gaussian saturated one-way layout with an ordinal factor. The least squares estimator for the mean vector in this saturated model is usually inadmissible. The hybrid shrinkage estimators of this paper exploit the possibility of slow variation in the dependence of the means on the ordered factor levels but do not assume it and respond well to faster variation if present. To motivate the development, candidate penalized least squares (PLS) estimators for the mean vector of a one-way layout are represented as shrinkage estimators relative to the penalty basis for the regression space. This canonical representation suggests further classes of candidate estimators for the unknown means: monotone shrinkage (MS) estimators or soft-thresholding (ST) estimators or, most generally, hybrid shrinkage (HS) estimators that combine the preceding two strategies. Adaptation selects the estimator within a candidate class that minimizes estimated risk. Under the Gaussian saturated one-way layout model, such adaptive estimators minimize risk asymptotically over the class of candidate estimators as the number of factor levels tends to infinity. Thereby, adaptive HS estimators asymptotically dominate adaptive MS and adaptive ST estimators as well as the least squares estimator. Local annihilators of polynomials, among them difference operators, generate penalty bases suitable for a range of numerical examples. In case studies, adaptive HS estimators recover high frequency details in the mean vector more reliably than PLS or MS estimators and low frequency details more reliably than ST estimators.
References
Beran, R. (1996). Confidence sets centered at $C_p$ estimators. Ann. Inst. Statist. Math. 48 1--15.
Beran, R. (2000). REACT scatterplot smoothers: Superefficiency through basis economy. J. Amer. Statist. Assoc. 95 155--171.
Beran, R. (2001). Discussion of ``Local extremes, runs, strings and multiresolution,'' by P. L. Davies and A. Kovac. Ann. Statist. 29 48--52.
Beran, R. and Dümbgen, L. (1998). Modulation of estimators and confidence sets. Ann. Statist. 26 1826--1856.
Brillinger, D. R. and Tukey, J. W. (1985). Spectrum analysis in the presence of noise: Some issues and examples. In The Collected Works of John W. Tukey II. Time Series: 1965--1984 (D. R. Brillinger, ed.) 1001--1141. Wadsworth, Monterey, CA.
Brockwell, P. J. and Davis, R. A. (1996). Introduction to Time Series and Forecasting. Springer, New York.
Buja, A., Hastie, T. and Tibshirani, R. (1989). Linear smoothers and additive models (with discussion). Ann. Statist. 17 453--555.
Mathematical Reviews (MathSciNet):
MR994249
Donoho, D. L. and Johnstone, I. M. (1995). Adapting to unknown smoothness via wavelet shrinkage. J. Amer. Statist. Assoc. 90 1200--1224.
Efromovich, S. (1999). Quasi-linear wavelet estimation. J. Amer. Statist. Assoc. 94 189--204.
Efromovich, S. and Pinsker, M. S. (1984). Learning algorithm for nonparametric filtering. Automat. Remote Control 1984 1434--1440.
Friedman, J. H. (2001). The role of statistics in the data revolution? Internat. Statist. Rev. 69 5--10.
Golubev, G. K. (1987). Adaptive asymptotically minimax estimators of smooth signals. Problems Inform. Transmission 23 47--55.
Mathematical Reviews (MathSciNet):
MR893970
James, W. and Stein, C. (1961). Estimation with quadratic loss. Proc. Fourth Berkeley Symp. Math. Statist. Probab. 1 361--379. Univ. California Press, Berkeley.
Mathematical Reviews (MathSciNet):
MR133191
Kneip, A. (1994). Ordered linear smoothers. Ann. Statist. 22 835--866.
Li, K.-C. and Hwang, J. T. (1984). The data-smoothing aspect of Stein estimates. Ann. Statist. 12 887--897.
Mathematical Reviews (MathSciNet):
MR751280
Mallows, C. L. (1973). Some comments on $C_p$. Technometrics 15 661--676.
Pinsker, M. S. (1980). Optimal filtration of square-integrable signals in Gaussian noise. Problems Inform. Transmission 16 120--133.
Mathematical Reviews (MathSciNet):
MR624591
Pisier, G. (1983). Some applications of the metric entropy condition to harmonic analysis. Banach Spaces, Harmonic Analysis, and Probability Theory. Lecture Notes in Math. 995 123--154. Springer, Berlin.
Mathematical Reviews (MathSciNet):
MR717231
Pollard, D. (1990). Empirical Processes: Theory and Applications. IMS, Hayward, CA.
Press, W. H., Teukolsky, S. A., Vetterling, W. T. and Flannery, B. P. (1992). Numerical Recipes in FORTRAN: The Art of Scientific Computing, 2nd ed. Cambridge Univ. Press.
Robertson, T., Wright, F. T. and Dykstra, R. L. (1988). Order Restricted Statistical Inference. Wiley, New York.
Mathematical Reviews (MathSciNet):
MR961262
Stein, C. (1956). Inadmissibility of the usual estimator for the mean of a multivariate normal distribution. Proc. Third Berkeley Symp. Math. Statist. Probab. 1 197--206. Univ. California Press, Berkeley.
Mathematical Reviews (MathSciNet):
MR84922
Stein, C. (1981). Estimation of the mean of a multivariate normal distribution. Ann. Statist. 9 1135--1151.
Mathematical Reviews (MathSciNet):
MR630098
Tukey, J. W. (1977). Exploratory Data Analysis. Addison--Wesley, Reading, MA.
Tukey, J. W. (1980). Methodological comments focused on opportunities. In Multivariate Techniques in Human Communication Research (P. R. Monge and J. N. Cappella, eds.) 490--528. Academic Press, New York.