February 2023 Optimal estimation of high-dimensional Gaussian location mixtures
Natalie Doss, Yihong Wu, Pengkun Yang, Harrison H. Zhou
Author Affiliations +
Ann. Statist. 51(1): 62-95 (February 2023). DOI: 10.1214/22-AOS2207

Abstract

This paper studies the optimal rate of estimation in a finite Gaussian location mixture model in high dimensions without separation conditions. We assume that the number of components k is bounded and that the centers lie in a ball of bounded radius, while allowing the dimension d to be as large as the sample size n. Extending the one-dimensional result of Heinrich and Kahn (Ann. Statist. 46 (2018) 2844–2870), we show that the minimax rate of estimating the mixing distribution in Wasserstein distance is Θ((d/n)1/4+n1/(4k2)), achieved by an estimator computable in time O(nd2+n5/4). Furthermore, we show that the mixture density can be estimated at the optimal parametric rate Θ(d/n) in Hellinger distance and provide a computationally efficient algorithm to achieve this rate in the special case of k=2.

Both the theoretical and methodological development rely on a careful application of the method of moments. Central to our results is the observation that the information geometry of finite Gaussian mixtures is characterized by the moment tensors of the mixing distribution, whose low-rank structure can be exploited to obtain a sharp local entropy bound.

Funding Statement

Y. Wu is supported in part by the NSF Grants CCF-1900507, an NSF CAREER award CCF-1651588, and an Alfred Sloan fellowship.
P. Yang was supported in part by the National Science Foundation of China (NSFC) Grant 12101353.
H. H. Zhou was supported in part by the National Science Foundation (NSF) Grant DMS-2112918.

Citation

Download Citation

Natalie Doss. Yihong Wu. Pengkun Yang. Harrison H. Zhou. "Optimal estimation of high-dimensional Gaussian location mixtures." Ann. Statist. 51 (1) 62 - 95, February 2023. https://doi.org/10.1214/22-AOS2207

Information

Received: 1 February 2020; Revised: 1 April 2022; Published: February 2023
First available in Project Euclid: 23 March 2023

MathSciNet: MR4564849
zbMATH: 07684005
Digital Object Identifier: 10.1214/22-AOS2207

Subjects:
Primary: 62G05 , 62G07
Secondary: 62C20

Keywords: Finite mixture model , Gaussian mixture , high-dimensional density estimation , low-rank tensor , method of moments , Metric entropy , Minimax optimality

Rights: Copyright © 2023 Institute of Mathematical Statistics

JOURNAL ARTICLE
34 PAGES

This article is only available to subscribers.
It is not available for individual sale.
+ SAVE TO MY LIBRARY

Vol.51 • No. 1 • February 2023
Back to Top