Open Access
2024 Trade-off between predictive performance and FDR control for high-dimensional Gaussian model selection
Perrine Lacroix, Marie-Laure Martin
Author Affiliations +
Electron. J. Statist. 18(2): 2886-2930 (2024). DOI: 10.1214/24-EJS2260

Abstract

In the context of high-dimensional Gaussian linear regression for ordered variables, we study the variable selection procedure via the minimization of the penalized least-squares criterion. We focus on model selection where the penalty function depends on an unknown multiplicative constant commonly calibrated for prediction. We propose a new proper calibration of this hyperparameter to simultaneously control predictive risk and false discovery rate. We obtain non-asymptotic bounds on the False Discovery Rate with respect to the hyperparameter and we provide an algorithm to calibrate it. This algorithm is based on quantities that can typically be observed in real data applications. The algorithm is validated in an extensive simulation study and is compared with several existing variable selection procedures. Finally, we study an extension of our approach to the case in which an ordering of the variables is not available.

Funding Statement

This research is supported in part by a public grant as part of the Investissement d’avenir project, reference ANR-11-LABX-0056-LMH, LabEx LMH. IPS2 benefits from the support of the LabEx Saclay Plant Sciences-SPS (ANR-17-EUR-0007).

Acknowledgments

The authors warmly thank and are grateful to Pascal Massart (Laboratoire de Mathématiques d’Orsay, Université Paris-Saclay) for helpful discussions and valuable comments. The authors would like to sincerely thank the Editor and the two anonymous referees for their valuable comments, suggestions and feedbacks which improved the paper.

Citation

Download Citation

Perrine Lacroix. Marie-Laure Martin. "Trade-off between predictive performance and FDR control for high-dimensional Gaussian model selection." Electron. J. Statist. 18 (2) 2886 - 2930, 2024. https://doi.org/10.1214/24-EJS2260

Information

Received: 1 July 2023; Published: 2024
First available in Project Euclid: 12 July 2024

Digital Object Identifier: 10.1214/24-EJS2260

Subjects:
Primary: 62J05 , 62P99
Secondary: 62J07

Keywords: FDR , Gaussian regression , high-dimension , hyperparameter calibration , Ordered variable selection , prediction

Vol.18 • No. 2 • 2024
Back to Top