Institute of Mathematical Statistics Collections

Asymptotic optimality of a cross-validatory predictive approach to linear model selection

Arijit Chakrabarti, Tapas Samanta

Abstract

In this article we study the asymptotic predictive optimality of a model selection criterion based on the cross-validatory predictive density, already available in the literature. For a dependent variable and associated explanatory variables, we consider a class of linear models as approximations to the true regression function. One selects a model among these using the criterion under study and predicts a future replicate of the dependent variable by an optimal predictor under the chosen model. We show that for squared error prediction loss, this scheme of prediction performs asymptotically as well as an oracle, where the oracle here refers to a model selection rule which minimizes this loss if the true regression were known.

First Page: Show Hide
Primary Subjects: 62J05
Secondary Subjects: 62F15
Keywords: cross-validation; oracle; predictive density
Full-text: Open access
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.imsc/1209398466
Digital Object Identifier: doi:10.1214/074921708000000110

References

[1] Barbieri, M. M. and Berger, J. O. (2004). Optimal predictive model selection. Ann. Statist. 32 870–897.
Mathematical Reviews (MathSciNet): MR2065192
Zentralblatt MATH: 1092.62033
Digital Object Identifier: doi:10.1214/009053604000000238
Project Euclid: euclid.aos/1085408489
[2] Berger, J. O. and Pericchi, L. R. (1996a). The intrinsic Bayes factor for model selection and prediction. J. Amer. Statist. Assoc. 91 109–122.
Mathematical Reviews (MathSciNet): MR1394065
Zentralblatt MATH: 0870.62021
Digital Object Identifier: doi:10.2307/2291387
[3] Berger, J. O. and Pericchi, L. R. (1996b). The intrinsic Bayes factor for linear models (with discussion). In Bayesian Statistics (J. M. Bernardo et al., eds.) 5 25–44. Oxford Univ. Press.
Mathematical Reviews (MathSciNet): MR1425398
[4] Bernardo, J. M. and Smith, A. F. M. (1994). Bayesian Theory. Wiley, Chichester.
Mathematical Reviews (MathSciNet): MR1274699
[5] Chakrabarti, A. and Ghosh, J. K. (2008). Some aspects of Bayesian model selection for prediction (with discussion). In Bayesian Statistics (J. M. Bernardo et al., eds.) 8. To appear.
Mathematical Reviews (MathSciNet): MR2433189
[6] Chatterjee, S. and Hadi, A. S. (1988). Sensitivity Analysis in Linear Regression. Wiley, New York.
Mathematical Reviews (MathSciNet): MR939610
Zentralblatt MATH: 0648.62066
[7] Geisser, S. (1975). The predictive sample reuse method with applications. J. Amer. Statist. Assoc. 70 320–328.
[8] Geisser, S. and Eddy, W. F. (1979). A predictive approach to model selection. J. Amer. Statist. Assoc. 74 153–160.
Mathematical Reviews (MathSciNet): MR529531
Digital Object Identifier: doi:10.2307/2286745
[9] Gelfand, A. E. and Dey, D. K. (1994). Bayesian model choice: asymptotics and exact calculations. J. Roy. Statist. Soc. Ser. B 56 501–514.
Mathematical Reviews (MathSciNet): MR1278223
[10] Gelfand, A. E. and Ghosh, S. K. (1998). Model choice: a minimum posterior predictive loss approach. Biometrika 85 1–11.
Mathematical Reviews (MathSciNet): MR1627258
Zentralblatt MATH: 0904.62036
Digital Object Identifier: doi:10.1093/biomet/85.1.1
[11] Ghosh, J. K. and Samanta, T. (2002). Nonsubjective Bayes testing – an overview. J. Statist. Plann. Inference 103 205–223.
Mathematical Reviews (MathSciNet): MR1896993
Zentralblatt MATH: 0989.62017
Digital Object Identifier: doi:10.1016/S0378-3758(01)00222-1
[12] Li, K.-C. (1987). Asymptotic optimality for Cp, CL, cross-validation and generalized cross-validation: discrete index set. Ann. Statist. 15 958–975.
Mathematical Reviews (MathSciNet): MR902239
Zentralblatt MATH: 0653.62037
Digital Object Identifier: doi:10.1214/aos/1176350486
Project Euclid: euclid.aos/1176350486
[13] Mukhopadhyay, N. (2000). Bayesian model selection for high dimensional models with prediction error loss and 0–1 loss. Ph.D. thesis, Purdue Univ.
[14] Mukhopadhyay, N., Ghosh, J. K. and Berger, J. O. (2005). Some Bayesian predictive approaches to model selection. Statist. Probab. Lett. 73 369–379.
Mathematical Reviews (MathSciNet): MR2187852
[15] O’Hagan, A. (1995). Fractional Bayes factors for model comparisons. J. Roy. Statist. Soc. Ser. B 57 99–138.
Mathematical Reviews (MathSciNet): MR1325379
[16] Shao, J. (1997). An asymptotic theory for linear model selection (with discussion). Statist. Sinica 7 221–264.
Mathematical Reviews (MathSciNet): MR1466682
Zentralblatt MATH: 1003.62527
[17] Stone, M. (1974). Cross-validatory choice and assessment of statistical predictions (with discussion). J. Roy. Statist. Soc. Ser. B 36 111–147.
Mathematical Reviews (MathSciNet): MR356377
[18] Whittle (1960). Bounds for the moments of linear and quadratic forms in independent variables. Theory Probab. Appl. 5 302–305.
Mathematical Reviews (MathSciNet): MR133849

2012 © Institute of Mathematical Statistics

Institute of Mathematical Statistics Collections

Institute of Mathematical Statistics Collections