Open Access
2024 Hierarchical Mixture of Finite Mixtures (with Discussion)
Alessandro Colombi, Raffaele Argiento, Federico Camerlenghi, Lucia Paci
Author Affiliations +
Bayesian Anal. Advance Publication 1-29 (2024). DOI: 10.1214/24-BA1501

Abstract

Statistical modelling in the presence of data organized in groups is a crucial task in Bayesian statistics. The present paper conceives a mixture model based on a novel family of Bayesian priors designed for multilevel data and obtained by normalizing a finite point process. In particular, the work extends the popular Mixture of Finite Mixtures model to the hierarchical framework to capture heterogeneity within and between groups. A full distribution theory for this new family and the induced clustering is developed, including the marginal, posterior, and predictive distributions. Efficient marginal and conditional Gibbs samplers are designed to provide posterior inference. The proposed mixture model outperforms the Hierarchical Dirichlet Process, the foremost tool for handling multilevel data, in terms of analytical feasibility, clustering discovery, and computational time. The motivating application comes from the analysis of shot put data, which contains performance measurements of athletes across different seasons. In this setting, the proposed model is exploited to induce clustering of the observations across seasons and athletes. By linking clusters across seasons, similarities and differences in athletes’ performances are identified.

Funding Statement

The first and the third authors gratefully acknowledge support from the Italian Ministry of Education, University and Research (MUR), “Dipartimenti di Eccellenza” grant 2023-2027. The research of the second and the third authors was partially supported by MUR-PRIN grant 2022 CLTYP4, funded by the European Union – Next Generation EU. The research of the fourth author was partially supported by MUR-PRIN grant 2022 SMNNKY, funded by the European Union – Next Generation EU.

Acknowledgments

We would like to thank the Editor, the Associate Editor, and the Referees for their insightful and constructive comments. We would like to thank Mario Beraha (Department of Mathematics, Politecnico di Milano) and Silvia Montagna (Department of Economics, University of Modena and Regio Emilia) for the helpful discussions.

Citation

Download Citation

Alessandro Colombi. Raffaele Argiento. Federico Camerlenghi. Lucia Paci. "Hierarchical Mixture of Finite Mixtures (with Discussion)." Bayesian Anal. Advance Publication 1 - 29, 2024. https://doi.org/10.1214/24-BA1501

Information

Published: 2024
First available in Project Euclid: 15 December 2024

Digital Object Identifier: 10.1214/24-BA1501

Subjects:
Primary: 62F15 , 62G05 , 62H30

Keywords: Model-based clustering , multilevel data , Partial exchangeability , Sports analytics , vector of finite Dirichlet processes

Rights: © 2024 International Society for Bayesian Analysis

Advance Publication
Back to Top