October 2022 Is infinity that far? A Bayesian nonparametric perspective of finite mixture models
Raffaele Argiento, Maria De Iorio
Author Affiliations +
Ann. Statist. 50(5): 2641-2663 (October 2022). DOI: 10.1214/22-AOS2201

Abstract

Mixture models are one of the most widely used statistical tools when dealing with data from heterogeneous populations. Following a Bayesian nonparametric perspective, we introduce a new class of priors: the Normalized Independent Point Process. We investigate the probabilistic properties of this new class and present many special cases. In particular, we provide an explicit formula for the distribution of the implied partition, as well as the posterior characterization of the new process in terms of the superposition of two discrete measures. We also provide consistency results. Moreover, we design both a marginal and a conditional algorithm for finite mixture models with a random number of components. These schemes are based on an auxiliary variable MCMC, which allows handling the otherwise intractable posterior distribution and overcomes the challenges associated with the Reversible Jump algorithm. We illustrate the performance and the potential of our model in a simulation study and on real data applications.

Funding Statement

Dr Argiento is grateful to National University of Singapore for the funding provided.

Acknowledgement

We would like to thank Dr Peter Galbusera at the Royal Zoological Society of Antwerp for sharing the enriched Taita Thrush Dataset. We are also grateful to Judith Rousseau, Igor Prünster and XuanLong Nguyen for their advice.

Maria de Iorio is also affiliated to the Department of Statistical Science at University College London (UK). Raffaele Argiento is also affiliated to Collegio Carlo Alberto Torino (IT).

Citation

Download Citation

Raffaele Argiento. Maria De Iorio. "Is infinity that far? A Bayesian nonparametric perspective of finite mixture models." Ann. Statist. 50 (5) 2641 - 2663, October 2022. https://doi.org/10.1214/22-AOS2201

Information

Received: 1 October 2019; Revised: 1 May 2022; Published: October 2022
First available in Project Euclid: 27 October 2022

MathSciNet: MR4505373
zbMATH: 07628835
Digital Object Identifier: 10.1214/22-AOS2201

Subjects:
Primary: 60G57 , 62F15
Secondary: 62G07 , 92C20

Keywords: Bayesian clustering , Bayesian mixture models , Dirichlet process , Markov chain Monte Carlo methods , mixture of finite mixtures

Rights: Copyright © 2022 Institute of Mathematical Statistics

Vol.50 • No. 5 • October 2022
Back to Top