Abstract
Recently, S. Arlot and R. Genuer have shown that a random forest model outperforms its single-tree counterpart in estimating α-Hölder functions, . This backs up the idea that ensembles of tree estimators are smoother estimators than single trees. On the other hand, most positive optimality results on Bayesian tree-based methods assume that . Naturally, one wonders whether Bayesian counterparts of forest estimators are optimal on smoother classes, just as observed with frequentist estimators for . We focus on density estimation and introduce an ensemble estimator from the classical (truncated) Pólya tree construction in Bayesian nonparametrics. Inspired by the work mentioned above, the resulting Bayesian forest estimator is shown to lead to optimal posterior contraction rates, up to logarithmic terms, for the Hellinger and distances on probability density functions on for arbitrary Hölder regularity . This improves upon previous results for constructions related to the Pólya tree prior, whose optimality was only proven when . Also, by adding a hyperprior on the trees’ depth, we obtain an adaptive version of the prior that does not require α to be specified to attain optimality.
Acknowledgements
I sincerely thank Pr. Ismaël Castillo (Sorbonne Université), whose guidance and insights into the study of Bayesian tree-based methods were of tremendous help in the making of this paper.
Citation
Thibault Randrianarisoa. "Smoothing and adaptation of shifted Pólya tree ensembles." Bernoulli 28 (4) 2492 - 2517, November 2022. https://doi.org/10.3150/21-BEJ1426