December 2023 Targeting underrepresented populations in precision medicine: A federated transfer learning approach
Sai Li, Tianxi Cai, Rui Duan
Author Affiliations +
Ann. Appl. Stat. 17(4): 2970-2992 (December 2023). DOI: 10.1214/23-AOAS1747

Abstract

The limited representation of minorities and disadvantaged populations in large-scale clinical and genomics research poses a significant barrier to translating precision medicine research into practice. Prediction models are likely to underperform in underrepresented populations due to heterogeneity across populations, thereby exacerbating known health disparities. To address this issue, we propose FETA, a two-way data integration method that leverages a federated transfer learning approach to integrate heterogeneous data from diverse populations and multiple healthcare institutions, with a focus on a target population of interest having limited sample sizes. We show that FETA achieves performance comparable to the pooled analysis, where individual-level data is shared across institutions, with only a small number of communications across participating sites. Our theoretical analysis and simulation study demonstrate how FETA’s estimation accuracy is influenced by communication budgets, privacy restrictions, and heterogeneity across populations. We apply FETA to multisite data from the electronic Medical Records and Genomics (eMERGE) Network to construct genetic risk prediction models for extreme obesity. Compared to models trained using target data only, source data only, and all data without accounting for population-level differences, FETA shows superior predictive performance. FETA has the potential to improve estimation and prediction accuracy in underrepresented populations and reduce the gap in model performance across populations.

Funding Statement

Rui Duan was supported by National Institutes of Health (R01 GM148494) and Tianxi Cai was supported by National Institutes of Health (R01 LM013614, R01 HL089778).

Acknowledgments

The authors would like to thank the anonymous referees, an Associate Editor, and the Editor for their constructive comments that improved the quality of this paper. The datasets used for the analyses described in this manuscript were obtained from dbGaP at http://www.ncbi.nlm.nih.gov/gap through dbGaP accession number phs000888.v1.p1.

Citation

Download Citation

Sai Li. Tianxi Cai. Rui Duan. "Targeting underrepresented populations in precision medicine: A federated transfer learning approach." Ann. Appl. Stat. 17 (4) 2970 - 2992, December 2023. https://doi.org/10.1214/23-AOAS1747

Information

Received: 1 March 2022; Revised: 1 October 2022; Published: December 2023
First available in Project Euclid: 30 October 2023

MathSciNet: MR4661684
Digital Object Identifier: 10.1214/23-AOAS1747

Keywords: Federated learning , health equity , Precision medicine , risk prediction , transfer learning

Rights: Copyright © 2023 Institute of Mathematical Statistics

Vol.17 • No. 4 • December 2023
Back to Top