Abstract
We provide a unifying approach to central limit type theorems for empirical optimal transport (OT). The limit distribution is given by a supremum of a centered Gaussian process, and we explicitly characterize when it is centered normal or degenerates to a Dirac measure. Moreover, in contrast to recent contributions to distributional limit laws for empirical OT on Euclidean spaces which require centering around its expectation, the limits obtained here are centered around the population quantity, which is well-suited for statistical applications such as goodness-of-fit testing and randomized OT computation. Overall, our distributional limits are valid if one of the population probability measures is of intrinsic dimension at most three. At the heart of our theory lies the Kantorovich duality which represents the OT cost as a supremum over a function class for an underlying sufficiently regular and possibly unbounded cost function c. In this regard, OT is considered as a functional defined on , the Banach space of bounded functionals from to and equipped with uniform norm. We prove the OT functional to be Hadamard directionally differentiable and conclude distributional convergence for increasing sample size via a functional delta method that necessitates weak convergence of an underlying empirical process in . The latter can be dealt with empirical process theory and requires to be a Donsker class. We give sufficient conditions depending on the dimension of the ground space, the underlying cost function and the probability measures under consideration to guarantee the Donsker property. Altogether, our approach reveals a noteworthy trade-off inherent in central limit theorems for empirical OT: Kantorovich duality requires to be sufficiently rich, while weak convergence of the underlying empirical processes only occurs if is not too complex.
Funding Statement
S. Hundrieser, M. Klatt, and T. Staudt gratefully acknowledge support from the DFG RTG 2088 and DFG EXC 2067/1- 390729940, and A. Munk of DFG CRC 1456 A4, C6.
Acknowledgements
The authors also thank the Associate Editor and two anonymous referees for helpful suggestions which significantly improved the quality of this work.
Citation
Shayan Hundrieser. Marcel Klatt. Axel Munk. Thomas Staudt. "A unifying approach to distributional limits for empirical optimal transport." Bernoulli 30 (4) 2846 - 2877, November 2024. https://doi.org/10.3150/23-BEJ1697
Information