Institute of Mathematical Statistics Collections

Risk and resampling under model uncertainty

Snigdhansu Chatterjee, Nitai D. Mukhopadhyay

Abstract

In statistical exercises where there are several candidate models, the traditional approach is to select one model using some data driven criterion and use that model for estimation, testing and other purposes, ignoring the variability of the model selection process. We discuss some problems associated with this approach. An alternative scheme is to use a model-averaged estimator, that is, a weighted average of estimators obtained under different models, as an estimator of a parameter. We show that the risk associated with a Bayesian model-averaged estimator is bounded as a function of the sample size, when parameter values are fixed. We establish conditions which ensure that a model-averaged estimator’s distribution can be consistently approximated using the bootstrap. A new, data-adaptive, model averaging scheme is proposed that balances efficiency of estimation without compromising applicability of the bootstrap. This paper illustrates that certain desirable risk and resampling properties of model-averaged estimators are obtainable when parameters are fixed but unknown; this complements several studies on minimaxity and other properties of post-model-selected and model-averaged estimators, where parameters are allowed to vary.

First Page: Show Hide
Primary Subjects: 60F12
Secondary Subjects: 60J05, 62C10, 62F40
Keywords: bootstrap; bounded risk; linear regression; model averaging; model selection
Full-text: Open access
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.imsc/1209398467
Digital Object Identifier: doi:10.1214/074921708000000129

References

[1] Andrews, D. W. K. and Guggenberger, P. (2005). Hybrid and size-corrected subsample methods. Cowles Foundation discussion paper # 1605.
[2] Andrews, D. W. K. and Guggenberger, P. (2005). The limit of finite sample size and a problem with subsampling. Cowles Foundation discussion paper # 1606.
[3] Chatterjee, S. and Bose, A. (2000). Variance estimation in high dimensional regression models. Statist. Sinica 10 497–515.
Mathematical Reviews (MathSciNet): MR1769754
Zentralblatt MATH: 0999.62051
[4] Hall, P. and Wilson, S. R. (1991). Two guidelines for bootstrap hypothesis testing. Biometrics 47 757–762.
Mathematical Reviews (MathSciNet): MR1132543
Digital Object Identifier: doi:10.2307/2532163
[5] Hjort, N. L. and Claeskens, G. (2003). Frequentist model average estimators. J. Amer. Statist. Assoc. 98 879–899.
Mathematical Reviews (MathSciNet): MR2041481
Zentralblatt MATH: 1047.62003
Digital Object Identifier: doi:10.1198/016214503000000828
[6] Leeb, H. (2006). The distribution of a linear predictor after model selection: unconditional finite sample distributions and asymptotic approximations. IMS Lecture Notes Monograph Series 49 291–311.
Mathematical Reviews (MathSciNet): MR2338549
Digital Object Identifier: doi:10.1214/074921706000000518
[7] Leeb, H. and Pötscher, B. M. (2003). The finite sample distribution of post-model-selection estimators and uniform versus non-uniform approximations. Econometric Theory 19 100–142.
Mathematical Reviews (MathSciNet): MR1965844
Zentralblatt MATH: 1032.62011
Digital Object Identifier: doi:10.1017/S0266466603191050
[8] Leeb, H. and Pötscher, B. M. (2005). Model selection and inference: facts and fiction. Econometric Theory 21 21–59.
Mathematical Reviews (MathSciNet): MR2153856
Digital Object Identifier: doi:10.1017/S0266466605050036
[9] Leeb, H. and Pötscher, B. M. (2006). Performance limits for the estimators of the risk or distribution of shrinkage type estimators, and some general lower risk bound results. Econometric Theory 22 69–97.
Mathematical Reviews (MathSciNet): MR2212693
Digital Object Identifier: doi:10.1017/S0266466606060038
[10] Leeb, H. and Pötscher, B. M. (2006). Can one estimate the conditional distribution of post-model-selection estimators? Ann. Statist. 34 2554–2591.
Mathematical Reviews (MathSciNet): MR2291510
Zentralblatt MATH: 1106.62029
Digital Object Identifier: doi:10.1214/009053606000000821
Project Euclid: euclid.aos/1169571807
[11] Leung, G. and Barron, A. R. (2006). Information theory and mixing least squares regressions. IEEE Trans. Inform. Theory 52 3396–3410.
Mathematical Reviews (MathSciNet): MR2242356
Digital Object Identifier: doi:10.1109/TIT.2006.878172
[12] Mammen, E. (1992a). Bootstrap, wild bootstrap and asymptotic normality. Probab. Theory Related Fields 93 439–455.
Mathematical Reviews (MathSciNet): MR1183886
Zentralblatt MATH: 0766.62021
Digital Object Identifier: doi:10.1007/BF01192716
[13] Mammen, E. (1992b). When Does Bootstrap Work: Asymptotic Results and Simulations. Springer, Berlin.
[14] Politis, D. N., Romano, J. P. and Wolf, M. (1999). Subsampling. Springer, New York.
Mathematical Reviews (MathSciNet): MR1707286
[15] Pötscher, B. M. (1991). Effects of model selection on inference. Econometric Theory 7 163–185.
Mathematical Reviews (MathSciNet): MR1128410
Digital Object Identifier: doi:10.1017/S0266466600004382
[16] Pötscher, B. M. (2006). The distribution of model averaging estimators and an impossibility result regarding its estimation. MPRA paper # 73.
Mathematical Reviews (MathSciNet): MR2427842
[17] Raftery, A. E. and Zheng, Y. (2003). Comment on “Frequentist model average estimators,” by N. L. Hjort and G. Claeskens. J. Amer. Statist. Assoc. 98 931–938.
[18] Samworth, R. (2003). A note on methods of restoring consistency to the bootstrap. Biometrika 90 985–990.
Mathematical Reviews (MathSciNet): MR2024773
Digital Object Identifier: doi:10.1093/biomet/90.4.985
[19] Sethuraman, J. (2004). Are super-efficient estimators super-powerful? Comm. Statist. Theory and Methods 33 2003–2013.
Mathematical Reviews (MathSciNet): MR2103058
Zentralblatt MATH: 02128323
Digital Object Identifier: doi:10.1081/STA-200026561
[20] Shen, X. and Dougherty, D. P. (2003). Discussion of “Frequentist model average estimators,” by N. L. Hjort and G. Claeskens. J. Amer. Statist. Assoc. 98 917–919.
[21] Yang, Y. (2003). Regression with multiple candidate models: selecting or mixing? Statist. Sinica 13 783–809.
Mathematical Reviews (MathSciNet): MR1997174
Zentralblatt MATH: 1028.62021
[22] Yang, Y. (2004). Aggregating regression procedures to improve performance. Bernoulli 10 25–47.
Mathematical Reviews (MathSciNet): MR2044592
Digital Object Identifier: doi:10.3150/bj/1077544602
Project Euclid: euclid.bj/1077544602
[23] Yang, Y. (2005). Can the strengths of AIC and BIC be shared? A conflict between model identification and regression estimation. Biometrika 92 937–950.
Mathematical Reviews (MathSciNet): MR2234196
Digital Object Identifier: doi:10.1093/biomet/92.4.937
[24] Yang, Y. (2007). Prediction/estimation with simple linear model: is it really that simple? Econometric Theory 23 1–36.
Mathematical Reviews (MathSciNet): MR2338950
Digital Object Identifier: doi:10.1017/S0266466607070016
[25] Yuan, Z. and Yang, Y. (2005). Combining linear regression models: when and how? J. Amer. Statist. Assoc. 100 1202–1214.
Mathematical Reviews (MathSciNet): MR2236435
Zentralblatt MATH: 1117.62454
Digital Object Identifier: doi:10.1198/016214505000000088

2013 © Institute of Mathematical Statistics

Institute of Mathematical Statistics Collections

Institute of Mathematical Statistics Collections