In this article we study the asymptotic predictive optimality of a model selection criterion based on the cross-validatory predictive density, already available in the literature. For a dependent variable and associated explanatory variables, we consider a class of linear models as approximations to the true regression function. One selects a model among these using the criterion under study and predicts a future replicate of the dependent variable by an optimal predictor under the chosen model. We show that for squared error prediction loss, this scheme of prediction performs asymptotically as well as an oracle, where the oracle here refers to a model selection rule which minimizes this loss if the true regression were known.
References
[1] Barbieri, M. M. and Berger, J. O. (2004). Optimal predictive model selection. Ann. Statist. 32 870–897.
[2] Berger, J. O. and Pericchi, L. R. (1996a). The intrinsic Bayes factor for model selection and prediction. J. Amer. Statist. Assoc. 91 109–122.
[3] Berger, J. O. and Pericchi, L. R. (1996b). The intrinsic Bayes factor for linear models (with discussion). In Bayesian Statistics (J. M. Bernardo et al., eds.) 5 25–44. Oxford Univ. Press.
[4] Bernardo, J. M. and Smith, A. F. M. (1994). Bayesian Theory. Wiley, Chichester.
[5] Chakrabarti, A. and Ghosh, J. K. (2008). Some aspects of Bayesian model selection for prediction (with discussion). In Bayesian Statistics (J. M. Bernardo et al., eds.) 8. To appear.
[6] Chatterjee, S. and Hadi, A. S. (1988). Sensitivity Analysis in Linear Regression. Wiley, New York.
Mathematical Reviews (MathSciNet):
MR939610
[7] Geisser, S. (1975). The predictive sample reuse method with applications. J. Amer. Statist. Assoc. 70 320–328.
[8] Geisser, S. and Eddy, W. F. (1979). A predictive approach to model selection. J. Amer. Statist. Assoc. 74 153–160.
Mathematical Reviews (MathSciNet):
MR529531
[9] Gelfand, A. E. and Dey, D. K. (1994). Bayesian model choice: asymptotics and exact calculations. J. Roy. Statist. Soc. Ser. B 56 501–514.
[10] Gelfand, A. E. and Ghosh, S. K. (1998). Model choice: a minimum posterior predictive loss approach. Biometrika 85 1–11.
[11] Ghosh, J. K. and Samanta, T. (2002). Nonsubjective Bayes testing – an overview. J. Statist. Plann. Inference 103 205–223.
[12] Li, K.-C. (1987). Asymptotic optimality for Cp, CL, cross-validation and generalized cross-validation: discrete index set. Ann. Statist. 15 958–975.
Mathematical Reviews (MathSciNet):
MR902239
[13] Mukhopadhyay, N. (2000). Bayesian model selection for high dimensional models with prediction error loss and 0–1 loss. Ph.D. thesis, Purdue Univ.
[14] Mukhopadhyay, N., Ghosh, J. K. and Berger, J. O. (2005). Some Bayesian predictive approaches to model selection. Statist. Probab. Lett. 73 369–379.
[15] O’Hagan, A. (1995). Fractional Bayes factors for model comparisons. J. Roy. Statist. Soc. Ser. B 57 99–138.
[16] Shao, J. (1997). An asymptotic theory for linear model selection (with discussion). Statist. Sinica 7 221–264.
[17] Stone, M. (1974). Cross-validatory choice and assessment of statistical predictions (with discussion). J. Roy. Statist. Soc. Ser. B 36 111–147.
Mathematical Reviews (MathSciNet):
MR356377
[18] Whittle (1960). Bounds for the moments of linear and quadratic forms in independent variables. Theory Probab. Appl. 5 302–305.
Mathematical Reviews (MathSciNet):
MR133849