The Annals of Statistics

A Comparison of GCV and GML for Choosing the Smoothing Parameter in the Generalized Spline Smoothing Problem

Grace Wahba

Full-text: Open access

Abstract

The partially improper prior behind the smoothing spline model is used to obtain a generalization of the maximum likelihood (GML) estimate for the smoothing parameter. Then this estimate is compared with the generalized cross validation (GCV) estimate both analytically and by Monte Carlo methods. The comparison is based on a predictive mean square error criteria. It is shown that if the true, unknown function being estimated is smooth in a sense to be defined then the GML estimate undersmooths relative to the GCV estimate and the predictive mean square error using the GML estimate goes to zero at a slower rate than the mean square error using the GCV estimate. If the true function is "rough" then the GCV and GML estimates have asymptotically similar behavior. A Monte Carlo experiment was designed to see if the asymptotic results in the smooth case were evident in small sample sizes. Mixed results were obtained for $n = 32$, GCV was somewhat better than GML for $n = 64$, and GCV was decidedly superior for $n = 128$. In the $n = 32$ case GCV was better for smaller $\sigma^2$ and the comparison close for larger $\sigma^2$. The theoretical results are shown to extend to the generalized spline smoothing model, which includes the estimate of functions given noisy values of various integrals of them.

Article information

Source
Ann. Statist., Volume 13, Number 4 (1985), 1378-1402.

Dates
First available in Project Euclid: 12 April 2007

Permanent link to this document
https://projecteuclid.org/euclid.aos/1176349743

Digital Object Identifier
doi:10.1214/aos/1176349743

Mathematical Reviews number (MathSciNet)
MR811498

Zentralblatt MATH identifier
0596.65004

JSTOR
links.jstor.org

Subjects
Primary: 65D07: Splines
Secondary: 65D10: Smoothing, curve fitting 62J02: General nonlinear regression 65R20: Integral equations

Keywords
Spline smoothing cross validation maximum likelihood integral equations

Citation

Wahba, Grace. A Comparison of GCV and GML for Choosing the Smoothing Parameter in the Generalized Spline Smoothing Problem. Ann. Statist. 13 (1985), no. 4, 1378--1402. doi:10.1214/aos/1176349743. https://projecteuclid.org/euclid.aos/1176349743


Export citation