Open Access
April 2020 A Berry–Esseen theorem for Pitman’s $\alpha $-diversity
Emanuele Dolera, Stefano Favaro
Ann. Appl. Probab. 30(2): 847-869 (April 2020). DOI: 10.1214/19-AAP1518

Abstract

This paper contributes to the study of the random number $K_{n}$ of blocks in the random partition of $\{1,\ldots,n\}$ induced by random sampling from the celebrated two parameter Poisson–Dirichlet process. For any $\alpha \in (0,1)$ and $\theta >-\alpha $ Pitman (Combinatorial Stochastic Processes (2006) Springer, Berlin) showed that $n^{-\alpha }K_{n}\stackrel{\text{a.s.}}{\longrightarrow }S_{\alpha,\theta }$ as $n\rightarrow +\infty $, where the limiting random variable, referred to as Pitman’s $\alpha $-diversity, is distributed according to a polynomially scaled Mittag–Leffler distribution function. Our main result is a Berry–Esseen theorem for Pitman’s $\alpha $-diversity $S_{\alpha,\theta }$, namely we show that \[\mathop{\mathrm{sup}}_{x\geq 0}\biggl\vert \mathsf{P}\biggl[\frac{K_{n}}{n^{\alpha }}\leq x\biggr]-\mathsf{P}[S_{\alpha,\theta }\leq x]\biggr\vert \leq\frac{C(\alpha,\theta )}{n^{\alpha }}\] holds for every $n\in \mathbb{N}$ with an explicit constant term $C(\alpha,\theta )$, for $\alpha \in (0,1)$ and $\theta >0$. The proof relies on three intermediate novel results which are of independent interest: (i) a probabilistic representation of the distribution of $K_{n}$ in terms of a compound distribution; (ii) a quantitative version of the Laplace’s approximation method for integrals; (iii) a refined quantitative bound for Poisson approximation. An application of our Berry–Esseen theorem is presented in the context of Bayesian nonparametric inference for species sampling problems, quantifying the error of a posterior approximation that has been extensively applied to infer the number of unseen species in a population.

Citation

Download Citation

Emanuele Dolera. Stefano Favaro. "A Berry–Esseen theorem for Pitman’s $\alpha $-diversity." Ann. Appl. Probab. 30 (2) 847 - 869, April 2020. https://doi.org/10.1214/19-AAP1518

Information

Received: 1 December 2018; Revised: 1 July 2019; Published: April 2020
First available in Project Euclid: 8 June 2020

zbMATH: 07236136
MathSciNet: MR4108124
Digital Object Identifier: 10.1214/19-AAP1518

Subjects:
Primary: 60F15 , 60G57

Keywords: Berry–Esseen theorem , Ewens–Pitman sampling model , Laplace approximation of integrals , Pitman’s $\alpha $-diversity , Poisson approximation , Poisson compound random partitions

Rights: Copyright © 2020 Institute of Mathematical Statistics

Vol.30 • No. 2 • April 2020
Back to Top