December 2024 Bayesian hidden Markov model for natural history of colorectal cancer: Handling misclassified observations, varying observation schemes and unobserved data
Aapeli Nevala, Sirpa Heinävaara, Tytti Sarkeala, Sangita Kulathinal
Author Affiliations +
Ann. Appl. Stat. 18(4): 3050-3070 (December 2024). DOI: 10.1214/24-AOAS1922

Abstract

Statistical modelling of individual-level event history data arising from varying observation schemes is a challenging problem, particularly due to unobserved and possibly misclassified individual states. Commonly used approaches rely on the hidden Markov models (HMM) to incorporate true underlying states. Each approach needs to account for the underlying data generating process and related external information and requires assumptions for estimation. This article develops a Bayesian HMM for natural history of colorectal cancer (CRC), combining data on latent disease states from randomised screening study and on observed clinical cancers from the population-based cancer registry. With our modelling approach and study design, we are able to provide estimates for latent state occupancy probabilities not only for screening-attenders but also for the control group and those who never attended screening—despite data on latent states only existing for the attenders. We use simulation-based calibration to ensure that posterior distributions can be reliably estimated despite the challenges brought in by the sampling scheme. We apply Bayesian computation to obtain posterior estimates of the quantities of interest. Two algorithms, Hamiltonian Monte Carlo (HMC) and Automatic Differentiation Variational Inference (ADVI), are applied and compared, first by using simulated data and then with a real data set. The modelling workflow can be applied for different cancer screening programmes and datasets which typically have similar challenges.

Funding Statement

The project was supported by Cancer Foundation Finland sr grant 190135.

Acknowledgments

Aapeli Nevala is also affiliated with Department of Mathematics and Statistics, Universty of Helsinki.

Sirpa Heinävaara is also affiliated with Department of Public Health, University of Helsinki.

Citation

Download Citation

Aapeli Nevala. Sirpa Heinävaara. Tytti Sarkeala. Sangita Kulathinal. "Bayesian hidden Markov model for natural history of colorectal cancer: Handling misclassified observations, varying observation schemes and unobserved data." Ann. Appl. Stat. 18 (4) 3050 - 3070, December 2024. https://doi.org/10.1214/24-AOAS1922

Information

Received: 1 October 2023; Revised: 1 April 2024; Published: December 2024
First available in Project Euclid: 31 October 2024

Digital Object Identifier: 10.1214/24-AOAS1922

Keywords: Bayesian statistics , cancer epidemiology , Cancer screening , event-history analysis , Hidden Markov model , multistate models , Stan

Rights: Copyright © 2024 Institute of Mathematical Statistics

Vol.18 • No. 4 • December 2024
Back to Top