The Annals of Applied Statistics
- Ann. Appl. Stat.
- Volume 12, Number 1 (2018), 222-245.
How Gaussian mixture models might miss detecting factors that impact growth patterns
Longitudinal studies play a prominent role in biological, social, and behavioral sciences. Repeated measurements over time facilitate the study of an outcome level, how individuals change over time, and the factors that may impact either or both. A standard approach to modeling childhood growth over time is to use multilevel or mixed effects models to study factors that might play a role in the level and growth over time. However, there has been increased interest in using mixture models, which have inherent grouping structure to more flexibly explain heterogeneity in the longitudinal outcomes, to study growth patterns. While several possible model specifications can be used, these methods generally fail to explicitly group individuals by the shape of their growth pattern separate from level, and thus fail to shed light on the relationships between growth pattern and potential explanatory factors. We illustrate the weaknesses of these methods as they are currently being used. We also propose a pre-processing step that removes the outcome level to focus explicitly on shape, discuss its impact on estimation, and demonstrate its usefulness though a simulation study and with real longitudinal data.
Ann. Appl. Stat. Volume 12, Number 1 (2018), 222-245.
Received: July 2016
Revised: May 2017
First available in Project Euclid: 9 March 2018
Permanent link to this document
Digital Object Identifier
Heggeseth, Brianna C.; Jewell, Nicholas P. How Gaussian mixture models might miss detecting factors that impact growth patterns. Ann. Appl. Stat. 12 (2018), no. 1, 222--245. doi:10.1214/17-AOAS1066. https://projecteuclid.org/euclid.aoas/1520564471
- Supplement A: Growth simulation. The supplement includes a description and the results an additional simulation study that mimics real childhood growth data.
- Supplement B: Additional CHAMACOS results. The supplement includes the relative risk ratio estimates from the CHAMACOS data example.