Open Access
December 2019 Robust elastic net estimators for variable selection and identification of proteomic biomarkers
Gabriela V. Cohen Freue, David Kepplinger, Matías Salibián-Barrera, Ezequiel Smucler
Ann. Appl. Stat. 13(4): 2065-2090 (December 2019). DOI: 10.1214/19-AOAS1269

Abstract

In large-scale quantitative proteomic studies, scientists measure the abundance of thousands of proteins from the human proteome in search of novel biomarkers for a given disease. Penalized regression estimators can be used to identify potential biomarkers among a large set of molecular features measured. Yet, the performance and statistical properties of these estimators depend on the loss and penalty functions used to define them. Motivated by a real plasma proteomic biomarkers study, we propose a new class of penalized robust estimators based on the elastic net penalty, which can be tuned to keep groups of correlated variables together in the selected model and maintain robustness against possible outliers. We also propose an efficient algorithm to compute our robust penalized estimators and derive a data-driven method to select the penalty term. Our robust penalized estimators have very good robustness properties and are also consistent under certain regularity conditions. Numerical results show that our robust estimators compare favorably to other robust penalized estimators. Using our proposed methodology for the analysis of the proteomics data, we identify new potentially relevant biomarkers of cardiac allograft vasculopathy that are not found with nonrobust alternatives. The selected model is validated in a new set of 52 test samples and achieves an area under the receiver operating characteristic (AUC) of 0.85.

Citation

Download Citation

Gabriela V. Cohen Freue. David Kepplinger. Matías Salibián-Barrera. Ezequiel Smucler. "Robust elastic net estimators for variable selection and identification of proteomic biomarkers." Ann. Appl. Stat. 13 (4) 2065 - 2090, December 2019. https://doi.org/10.1214/19-AOAS1269

Information

Received: 1 March 2018; Revised: 1 February 2019; Published: December 2019
First available in Project Euclid: 28 November 2019

zbMATH: 07160931
MathSciNet: MR4037422
Digital Object Identifier: 10.1214/19-AOAS1269

Keywords: elastic net penalty , penalized estimation , proteomics biomarkers , regularized estimation , robust estimation

Rights: Copyright © 2019 Institute of Mathematical Statistics

Vol.13 • No. 4 • December 2019
Back to Top