Open Access
2020 On the distribution, model selection properties and uniqueness of the Lasso estimator in low and high dimensions
Karl Ewald, Ulrike Schneider
Electron. J. Statist. 14(1): 944-969 (2020). DOI: 10.1214/20-EJS1687

Abstract

We derive expressions for the finite-sample distribution of the Lasso estimator in the context of a linear regression model in low as well as in high dimensions by exploiting the structure of the optimization problem defining the estimator. In low dimensions, we assume full rank of the regressor matrix and present expressions for the cumulative distribution function as well as the densities of the absolutely continuous parts of the estimator. Our results are presented for the case of normally distributed errors, but do not hinge on this assumption and can easily be generalized. Additionally, we establish an explicit formula for the correspondence between the Lasso and the least-squares estimator. We derive analogous results for the distribution in less explicit form in high dimensions where we make no assumptions on the regressor matrix at all. In this setting, we also investigate the model selection properties of the Lasso and show that possibly only a subset of models might be selected by the estimator, completely independently of the observed response vector. Finally, we present a condition for uniqueness of the estimator that is necessary as well as sufficient.

Citation

Download Citation

Karl Ewald. Ulrike Schneider. "On the distribution, model selection properties and uniqueness of the Lasso estimator in low and high dimensions." Electron. J. Statist. 14 (1) 944 - 969, 2020. https://doi.org/10.1214/20-EJS1687

Information

Received: 1 April 2019; Published: 2020
First available in Project Euclid: 18 February 2020

zbMATH: 07200222
MathSciNet: MR4065178
Digital Object Identifier: 10.1214/20-EJS1687

Subjects:
Primary: 62E15
Secondary: 62J05 , 62J07

Keywords: distribution , Lasso , Model selection , uniqueness

Vol.14 • No. 1 • 2020
Back to Top