Source: Ann. Statist. Volume 32, Number 6
(2004), 2444-2468.
This paper studies, under the setting of spline regression, the connection between finite-sample properties of selection criteria and their asymptotic counterparts, focusing on bridging the gap between the two. We introduce a bias-variance decomposition of the prediction error, using which it is shown that in the asymptotics the bias term dominates the variability term, providing an explanation of the gap. A geometric exposition is provided for intuitive understanding. The theoretical and geometric results are illustrated through a numerical example.
References
Abramowitz, M. and Stegun, I. (1972). Handbook of Mathematical Functions, 10th printing. National Bureau of Standards, Washington.
Bowman, A. and Azzalini, A. (1997). Applied Smoothing Techniques for Data Analysis: The Kernel Approach with S-Plus Illustrations. Oxford Univ. Press, New York.
Demmler, A. and Reinsch, C. (1975). Oscillation matrices with spline smoothing. Numer. Math. 24 375--382.
Mathematical Reviews (MathSciNet):
MR395161
Efron, B. (1975). Defining the curvature of a statistical problem (with application to second order efficiency) (with discussion). Ann. Statist. 3 1189--1242.
Mathematical Reviews (MathSciNet):
MR428531
Efron, B. (2001). Selection criteria for scatterplot smoothers. Ann. Statist. 29 470--504.
Eubank, R. (1988). Spline Smoothing and Nonparametric Regression. Dekker, New York.
Mathematical Reviews (MathSciNet):
MR934016
Fan, J. (2000). Prospects of nonparametric modeling. J. Amer. Statist. Assoc. 95 1296--1300.
Fan, J. and Gijbels, I. (1996). Local Polynomial Modelling and Its Applications. Chapman and Hall, London.
Feller, W. (1971). An Introduction to Probability Theory and Its Applications 2, 2nd ed. Wiley, New York.
Green, P. and Silverman, B. (1994). Nonparametric Regression and Generalized Linear Models. Chapman and Hall, London.
Gu, C. (1998). Model indexing and smoothing parameter selection in nonparametric function estimation (with discussion). Statist. Sinica 8 607--646.
Hall, P. and Johnstone, I. (1992). Empirical functionals and efficient smoothing parameter selection (with discussion). J. Roy. Statist. Soc. Ser. B 54 475--530.
Härdle, W. (1990). Applied Nonparametric Regression. Cambridge Univ. Press.
Härdle, W., Hall, P. and Marron, J. (1988). How far are automatically chosen regression smoothing parameters from their optimum (with discussion)? J. Amer. Statist. Assoc. 83 86--101.
Mathematical Reviews (MathSciNet):
MR941001
Hastie, T. and Tibshirani, R. (1990). Generalized Additive Models. Chapman and Hall, London.
Hurvich, C., Simonoff, J. and Tsai, C. (1998). Smoothing parameter selection in nonparametric regression using an improved Akaike information criterion. J. R. Stat. Soc. Ser. B Stat. Methodol. 60 271--294.
Jones, M., Marron, J. and Sheather, S. (1996). A brief survey of bandwidth selection for density estimation. J. Amer. Statist. Assoc. 91 401--407.
Kohn, R., Ansley, C. and Tharm, D. (1991). The performance of cross-validation and maximum likelihood estimators of spline smoothing parameters. J. Amer. Statist. Assoc. 86 1042--1050.
Kou, S. C. (2003). On the efficiency of selection criteria in spline regression. Probab. Theory Related Fields 127 153--176.
Kou, S. C. and Efron, B. (2002). Smoothers and the $C_p$, generalized maximum likelihood and extended exponential criteria: A geometric approach. J. Amer. Statist. Assoc. 97 766--782.
Li, K.-C. (1986). Asymptotic optimality of $C_L$ and generalized cross-validation in ridge regression with application to spline smoothing. Ann. Statist. 14 1101--1112.
Mathematical Reviews (MathSciNet):
MR856808
Li, K.-C. (1987). Asymptotic optimality for $C_p$, $C_L$, cross-validation and generalized cross-validation: Discrete index set. Ann. Statist. 15 958--975.
Mathematical Reviews (MathSciNet):
MR902239
Rosenblatt, M. (1991). Stochastic Curve Estimation. IMS, Hayward, CA.
Silverman, B. (1985). Some aspects of the spline smoothing approach to nonparametric regression curve fitting (with discussion). J. Roy. Statist. Soc. Ser. B 47 1--52.
Mathematical Reviews (MathSciNet):
MR805063
Simonoff, J. (1996). Smoothing Methods in Statistics. Springer, New York.
Speckman, P. (1983). Efficient nonparametric regression with cross-validated smoothing splines. Unpublished manuscript.
Speckman, P. (1985). Spline smoothing and optimal rates of convergence in nonparametric regression models. Ann. Statist. 13 970--983.
Mathematical Reviews (MathSciNet):
MR803752
Speckman, P. and Sun, D. (2001). Asymptotic properties of smoothing parameter selection in spline regression. Preprint.
Stein, M. (1990). A comparison of generalized cross validation and modified maximum likelihood for estimating the parameters of a stochastic process. Ann. Statist. 18 1139--1157.
Wahba, G. (1985). A comparison of GCV and GML for choosing the smoothing parameter in the generalized spline smoothing problem. Ann. Statist. 13 1378--1402.
Mathematical Reviews (MathSciNet):
MR811498
Wahba, G. (1990). Spline Models for Observational Data. SIAM, Philadelphia.
Wecker, W. and Ansley, C. (1983). The signal extraction approach to nonlinear regression and spline smoothing. J. Amer. Statist. Assoc. 78 81--89.
Mathematical Reviews (MathSciNet):
MR696851