Source: Ann. Statist. Volume 34, Number 6
(2006), 2897-2920.
We study the rate of convergence of posterior distributions in density estimation problems for log-densities in periodic Sobolev classes characterized by a smoothness parameter p. The posterior expected density provides a nonparametric estimation procedure attaining the optimal minimax rate of convergence under Hellinger loss if the posterior distribution achieves the optimal rate over certain uniformity classes. A prior on the density class of interest is induced by a prior on the coefficients of the trigonometric series expansion of the log-density. We show that when p is known, the posterior distribution of a Gaussian prior achieves the optimal rate provided the prior variances die off sufficiently rapidly. For a mixture of normal distributions, the mixing weights on the dimension of the exponential family are assumed to be bounded below by an exponentially decreasing sequence. To avoid the use of infinite bases, we develop priors that cut off the series at a sample-size-dependent truncation point. When the degree of smoothness is unknown, a finite mixture of normal priors indexed by the smoothness parameter, which is also assigned a prior, produces the best rate. A rate-adaptive estimator is derived.
References
Barron, A. R. (1988). The exponential convergence of posterior probabilities with implications for Bayes estimators of density functions. Technical Report 7, Dept. Statistics, Univ. Illinois, Champaign.
Barron, A., Schervish, M. J. and Wasserman, L. (1999). The consistency of posterior distributions in nonparametric problems. Ann. Statist. 27 536--561.
Barron, A. R. and Sheu, C.-H. (1991). Approximation of density functions by sequences of exponential families. Ann. Statist. 19 1347--1369.
Belitser, E. and Ghosal, S. (2003). Adaptive Bayesian inference on the mean of an infinite-dimensional normal distribution. Ann. Statist. 31 536--559.
Billingsley, P. (1995). Probability and Measure, 3rd ed. Wiley, New York.
Birman, M. Š. and Solomjak, M. Z. (1967). Piecewise polynomial approximations of functions of classes $W_p^\alpha$. Mat. Sb. (N.S.) 73 331--355 (in Russian).
Crain, B. R. (1973). A note on density estimation using orthogonal expansions. J. Amer. Statist. Assoc. 68 964--965.
Crain, B. R. (1974). Estimation of distributions using orthogonal expansions. Ann. Statist. 2 454--463.
Crain, B. R. (1976). Exponential models, maximum likelihood estimation, and the Haar condition. J. Amer. Statist. Assoc. 71 737--740.
Crain, B. R. (1976). More on estimation of distributions using orthogonal expansions. J. Amer. Statist. Assoc. 71 741--745.
Freedman, D. (1999). On the Bernstein--von Mises theorem with infinite-dimensional parameters. Ann. Statist. 27 1119--1140.
Ghosal, S. (2001). Convergence rates for density estimation with Bernstein polynomials. Ann. Statist. 29 1264--1280.
Ghosal, S., Ghosh, J. K. and Ramamoorthi, R. V. (1999). Posterior consistency of Dirichlet mixtures in density estimation. Ann. Statist. 27 143--158.
Ghosal, S., Ghosh, J. K. and van der Vaart, A. W. (2000). Convergence rates of posterior distributions. Ann. Statist. 28 500--531.
Ghosal, S. and van der Vaart, A. W. (2001). Entropies and rates of convergence for maximum likelihood and Bayes estimation for mixtures of normal densities. Ann. Statist. 29 1233--1263.
Huang, T.-M. (2004). Convergence rates for posterior distributions and adaptive estimation. Ann. Statist. 32 1556--1593.
Lenk, P. J. (1991). Towards a practicable Bayesian nonparametric density estimator. Biometrika 78 531--543.
Petrone, S. and Wasserman, L. (2002). Consistency of Bernstein polynomial posteriors. J. R. Stat. Soc. Ser. B Stat. Methodol. 64 79--100.
Shen, X. and Wasserman, L. (2001). Rates of convergence of posterior distributions. Ann. Statist. 29 687--714.
van de Geer, S. A. (2000). Empirical Processes in M-Estimation. Cambridge Univ. Press.
Verdinelli, I. and Wasserman, L. (1998). Bayesian goodness-of-fit testing using infinite-dimensional exponential families. Ann. Statist. 26 1215--1241.
Walker, S. (2004). New approaches to Bayesian consistency. Ann. Statist. 32 2028--2043.
Walker, S. and Hjort, N. L. (2001). On Bayesian consistency. J. R. Stat. Soc. Ser. B Stat. Methodol. 63 811--821.
Wasserman, L. (1998). Asymptotic properties of nonparametric Bayesian procedures. In Practical Nonparametric and Semiparametric Bayesian Statistics. Lecture Notes in Statist. 133 293--304. Springer, New York.
Yang, Y. and Barron, A. (1999). Information-theoretic determination of minimax rates of convergence. Ann. Statist. 27 1564--1599.
Zhao, L. H. (2000). Bayesian aspects of some nonparametric problems. Ann. Statist. 28 532--552.