Nonparametric bootstrap prediction



Bernoulli

Nonparametric bootstrap prediction

Tadayoshi Fushiki, Fumiyasu Komaki, and Kazuyuki Aihara

Source: Bernoulli Volume 11, Number 2 (2005), 293-307.

Abstract

Ensemble learning has recently been intensively studied in the field of machine learning. `Bagging' is a method of ensemble learning and uses bootstrap data to construct various predictors. The required prediction is then obtained by averaging the predictors. Harris proposed using this technique with the parametric bootstrap predictive distribution to construct predictive distributions, and showed that the parametric bootstrap predictive distribution gives asymptotically better prediction than a plug-in distribution with the maximum likelihood estimator. In this paper, we investigate nonparametric bootstrap predictive distributions. The nonparametric bootstrap predictive distribution is precisely that obtained by applying bagging to the statistical prediction problem. We show that the nonparametric bootstrap predictive distribution gives predictions asymptotically as good as the parametric bootstrap predictive distribution.

Keywords: asymptotic theory; bagging; bootstrap predictive distribution; information geometry; Kullback-Leibler divergence

Full-text: Open access

Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.bj/1116340296
Mathematical Reviews number (MathSciNet): MR2132728
Zentralblatt MATH identifier: 1063.62062
Digital Object Identifier: doi:10.3150/bj/1116340296

References

[1] Aitchison, J. (1975) Goodness of predictive fit. Biometrika, 62, 547-554.
[2] Amari, S. (1985) Differential-Geometrical Methods in Statistics. New York: Springer-Verlag.
[3] Amari, S. and Nagaoka, H. (2000) Methods of Information Geometry. New York: AMS and Oxford University Press.
[4] Breiman, L. (1996) Bagging predictors. Machine Learning, 24, 123-140.
[5] Freund, Y. and Schapire, R. (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. System Sci., 55, 119-139.
[6] Fushiki, T., Komaki, F. and Aihara, K. (2004) On parametric bootstrapping and Bayesian prediction. Scand. J. Statist., 31 403-416.
[7] Harris, I.R. (1989) Predictive fit for natural exponential families. Biometrika, 76, 675-684.
[8] Hartigan, J.A. (1998) The maximum likelihood prior. Ann. Statist., 26, 2083-2103.
[9] Komaki, F. (1996) On asymptotic properties of predictive distributions. Biometrika, 83, 299-313.
[10] McCullagh, P. (1987) Tensor Methods in Statistics. London: Chapman & Hall.
[11] Vidoni, P. (1995) A simple predictive density based on the p*-formula. Biometrika, 82, 855-863.

2008 © Bernoulli Society for Mathematical Statistics and Probability