Open Access
February 2021 Comparing a large number of multivariate distributions
Ilmun Kim
Bernoulli 27(1): 419-441 (February 2021). DOI: 10.3150/20-BEJ1244

Abstract

In this paper, we propose a test for the equality of multiple distributions based on kernel mean embeddings. Our framework provides a flexible way to handle multivariate data by virtue of kernel methods and allows the number of distributions to increase with the sample size. This is in contrast to previous studies that have been mostly restricted to classical univariate settings with a fixed number of distributions. By building on Cramér-type moderate deviation for degenerate two-sample $V$-statistics, we derive the limiting null distribution of the test statistic and show that it converges to a Gumbel distribution. The limiting distribution, however, depends on an infinite number of nuisance parameters, which makes it infeasible for use in practice. To address this issue, the proposed test is implemented via the permutation procedure and is shown to be minimax rate optimal against sparse alternatives. During our analysis, an exponential concentration inequality for the permuted test statistic is developed which may be of independent interest.

Citation

Download Citation

Ilmun Kim. "Comparing a large number of multivariate distributions." Bernoulli 27 (1) 419 - 441, February 2021. https://doi.org/10.3150/20-BEJ1244

Information

Received: 1 May 2019; Revised: 1 February 2020; Published: February 2021
First available in Project Euclid: 20 November 2020

zbMATH: 07282856
MathSciNet: MR4177375
Digital Object Identifier: 10.3150/20-BEJ1244

Keywords: Bobkov’s inequality , K-sample test , maximum mean discrepancy , Permutation test

Rights: Copyright © 2021 Bernoulli Society for Mathematical Statistics and Probability

Vol.27 • No. 1 • February 2021
Back to Top