Open Access
December 2024 Dynamic topic language model on heterogeneous children’s mental health clinical notes
Hanwen Ye, Tatiana Moreno, Adrianne Alpern, Louis Ehwerhemuepha, Annie Qu
Author Affiliations +
Ann. Appl. Stat. 18(4): 3165-3184 (December 2024). DOI: 10.1214/24-AOAS1930

Abstract

Mental health diseases which affect children’s lives and well-beings have received increased attention since the COVID-19 pandemic. Analyzing psychiatric clinical notes with topic models is critical to evaluating children’s mental status over time. However, few topic models are built for longitudinal settings, and most existing approaches fail to capture temporal trajectories for each document. To address these challenges, we develop a dynamic topic model with consistent topics and individualized temporal dependencies on the evolving document metadata. Our model preserves the semantic meaning of discovered topics over time and incorporates heterogeneity among documents. In particular, when documents can be categorized, we propose a classifier-free approach to maximize topic heterogeneity across different document groups. We also present an efficient variational optimization procedure adapted for the multistage longitudinal setting. In this case study, we apply our method to the psychiatric clinical notes from a large tertiary pediatric hospital in Southern California and achieve a 38% increase in the overall coherence of extracted topics. Our real data analysis reveals that children tend to express more negative emotions during state shutdowns and more positive when schools reopen. Furthermore, it suggests that sexual and gender minority (SGM) children display more pronounced reactions to major COVID-19 events and a greater sensitivity to vaccine-related news than non-SGM children. This study examines children’s mental health progression during the pandemic and offers clinicians valuable insights to recognize disparities in children’s mental health related to their sexual and gender identities.

Citation

Download Citation

Hanwen Ye. Tatiana Moreno. Adrianne Alpern. Louis Ehwerhemuepha. Annie Qu. "Dynamic topic language model on heterogeneous children’s mental health clinical notes." Ann. Appl. Stat. 18 (4) 3165 - 3184, December 2024. https://doi.org/10.1214/24-AOAS1930

Information

Received: 1 December 2023; Revised: 1 April 2024; Published: December 2024
First available in Project Euclid: 31 October 2024

Digital Object Identifier: 10.1214/24-AOAS1930

Keywords: Classifier-free , multistage topic language models , sexual and gender identity , time-consistent topics , variational inference

Rights: Copyright © 2024 Institute of Mathematical Statistics

Vol.18 • No. 4 • December 2024
Back to Top