Open Access
2016 Minimum Distance Lasso for robust high-dimensional regression
Aurélie C. Lozano, Nicolai Meinshausen, Eunho Yang
Electron. J. Statist. 10(1): 1296-1340 (2016). DOI: 10.1214/16-EJS1136

Abstract

We propose a minimum distance estimation method for robust regression in sparse high-dimensional settings. Likelihood-based estimators lack resilience against outliers and model misspecification, a critical issue when dealing with high-dimensional noisy data. Our method, Minimum Distance Lasso (MD-Lasso), combines minimum distance functionals customarily used in nonparametric estimation for robustness, with $\ell_{1}$-regularization. MD-Lasso is governed by a scaling parameter capping the influence of outliers: the loss is locally convex and close to quadratic for small squared residuals, and flattens for squared residuals larger than the scaling parameter. As the parameter approaches infinity the estimator becomes equivalent to least-squares Lasso. MD-Lasso is able to maintain the robustness of minimum distance functionals in sparse high-dimensional regression. The estimator achieves maximum breakdown point and enjoys consistency with fast convergence rates under mild conditions on the model error distribution. These hold for any solution in a convexity region around the true parameter and in certain cases for every solution. We provide an alternative set of results that do not require the solutions to lie within the convexity region but where the $\ell_{2}$-norm of the feasible solutions is constrained within a safety radius. Thanks to this constraint, a first-order optimization method is able to produce local optima that are consistent. A connection is established with re-weighted least-squares that intuitively explains MD-Lasso robustness. The merits of our method are demonstrated through simulation and eQTL analysis.

Citation

Download Citation

Aurélie C. Lozano. Nicolai Meinshausen. Eunho Yang. "Minimum Distance Lasso for robust high-dimensional regression." Electron. J. Statist. 10 (1) 1296 - 1340, 2016. https://doi.org/10.1214/16-EJS1136

Information

Received: 1 August 2014; Published: 2016
First available in Project Euclid: 19 May 2016

zbMATH: 1349.62322
MathSciNet: MR3504182
Digital Object Identifier: 10.1214/16-EJS1136

Keywords: high-dimensional variable selection , Lasso , robust estimation , Sparse learning

Rights: Copyright © 2016 The Institute of Mathematical Statistics and the Bernoulli Society

Vol.10 • No. 1 • 2016
Back to Top