Annals of Applied Statistics
- Ann. Appl. Stat.
- Volume 11, Number 4 (2017), 2080-2110.
Variable selection for latent class analysis with application to low back pain diagnosis
Michael Fop, Keith M. Smart, and Thomas Brendan Murphy
Abstract
The identification of most relevant clinical criteria related to low back pain disorders may aid the evaluation of the nature of pain suffered in a way that usefully informs patient assessment and treatment. Data concerning low back pain can be of categorical nature, in the form of a check-list in which each item denotes presence or absence of a clinical condition. Latent class analysis is a model-based clustering method for multivariate categorical responses, which can be applied to such data for a preliminary diagnosis of the type of pain. In this work, we propose a variable selection method for latent class analysis applied to the selection of the most useful variables in detecting the group structure in the data. The method is based on the comparison of two different models and allows the discarding of those variables with no group information and those variables carrying the same information as the already selected ones. We consider a swap-stepwise algorithm where at each step the models are compared through an approximation to their Bayes factor. The method is applied to the selection of the clinical criteria most useful for the clustering of patients in different classes. It is shown to perform a parsimonious variable selection and to give a clustering performance comparable to the expert-based classification of patients into three classes of pain.
Article information
Source
Ann. Appl. Stat., Volume 11, Number 4 (2017), 2080-2110.
Dates
Received: February 2017
Revised: May 2017
First available in Project Euclid: 28 December 2017
Permanent link to this document
https://projecteuclid.org/euclid.aoas/1514430278
Digital Object Identifier
doi:10.1214/17-AOAS1061
Mathematical Reviews number (MathSciNet)
MR3743289
Zentralblatt MATH identifier
1383.62268
Keywords
Clinical criteria selection clustering latent class analysis low back pain mixture models model-based clustering variable selection
Citation
Fop, Michael; Smart, Keith M.; Murphy, Thomas Brendan. Variable selection for latent class analysis with application to low back pain diagnosis. Ann. Appl. Stat. 11 (2017), no. 4, 2080--2110. doi:10.1214/17-AOAS1061. https://projecteuclid.org/euclid.aoas/1514430278
Supplemental materials
- Supplementary information, data and R code [Fop, Smart and Murphy (2017)]. The .zip folder contains a document with: further considerations regarding the “don’t know” entries, a description of the backward-stepwise selection algorithm for the multinomial logistic regression, a detailed description of the simulated data experiments, a complete list of clinical criteria and a notation page for reference. The folder also contains the data used in this paper and R code implementing the variable selection method.Digital Object Identifier: doi:10.1214/17-AOAS1061SUPPSupplemental files are immediately available to subscribers. Non-subscribers gain access to supplemental files with the purchase of the article.

