December 2022 Data-adaptive efficient estimation strategies for biomarker studies embedded in randomized trials
Wei Zhang, Zhiwei Zhang, James F. Troendle, Aiyi Liu
Author Affiliations +
Ann. Appl. Stat. 16(4): 2250-2265 (December 2022). DOI: 10.1214/21-AOAS1588

Abstract

Predictive and prognostic biomarkers are increasingly important in clinical research and practice. Biomarker studies are frequently embedded in randomized clinical trials with biospecimens collected at baseline and assayed for biomarkers, either in real time or retrospectively. This article proposes efficient estimation strategies for two study settings in terms of biomarker ascertainment: a complete-data setting in which the biomarker is measured for all subjects in the trial, and a two-phase sampling design in which the biomarker is measured retrospectively for a random subsample of subjects selected in an outcome-dependent fashion. In both settings, efficient estimating functions are characterized using semiparametric theory and approximated using data-adaptive machine learning methods, leading to estimators that are consistent, asymptotically normal and (approximately) efficient under general conditions. The proposed methods are evaluated in simulation studies and applied to real data from two biomarker studies, one in each setting.

Funding Statement

This research was supported in part by the intramural research programs of the Eunice Kennedy Shriver National Institute of Child Health and Human Development and the National Heart, Lung and Blood Institute, National Institutes of Health.
Research of W. Zhang was partially supported by the National Natural Science Foundation of China [grant numbers 12001522, 72091212].

Acknowledgments

This manuscript was prepared using data from Data set NCT00644228-D1 from the NCTN/NCORP Data Archive of the National Cancer Institute’s (NCI’s) National Clinical Trials Network (NCTN). Data were originally collected from clinical trial NCT number NCT00644228 “Lenalidomide and Dexamethasone With or Without Bortezomib in Treating Patients With Previously Untreated Multiple Myeloma.” All analyses and conclusions in this manuscript are the sole responsibility of the authors and do not necessarily reflect the opinions or views of the clinical trial investigators, the NCTN, the NCORP or the NCI.

Citation

Download Citation

Wei Zhang. Zhiwei Zhang. James F. Troendle. Aiyi Liu. "Data-adaptive efficient estimation strategies for biomarker studies embedded in randomized trials." Ann. Appl. Stat. 16 (4) 2250 - 2265, December 2022. https://doi.org/10.1214/21-AOAS1588

Information

Received: 1 March 2021; Revised: 1 September 2021; Published: December 2022
First available in Project Euclid: 26 September 2022

MathSciNet: MR4489208
zbMATH: 1498.62275
Digital Object Identifier: 10.1214/21-AOAS1588

Keywords: augmentation , Precision medicine , semiparametric theory , super learner , two-phase sampling

Rights: Copyright © 2022 Institute of Mathematical Statistics

JOURNAL ARTICLE
16 PAGES

This article is only available to subscribers.
It is not available for individual sale.
+ SAVE TO MY LIBRARY

Vol.16 • No. 4 • December 2022
Back to Top