Open Access
2023 A convex-nonconvex strategy for grouped variable selection
Xiaoqian Liu, Aaron J. Molstad, Eric C. Chi
Author Affiliations +
Electron. J. Statist. 17(2): 2912-2961 (2023). DOI: 10.1214/23-EJS2167

Abstract

This paper deals with the grouped variable selection problem. A widely used strategy is to augment the negative log-likelihood function with a sparsity-promoting penalty. Existing methods include the group Lasso, group SCAD, and group MCP. The group Lasso solves a convex optimization problem but suffers from underestimation bias. The group SCAD and group MCP avoid this estimation bias but require solving a nonconvex optimization problem that may be plagued by suboptimal local optima. In this work, we propose an alternative method based on the generalized minimax concave (GMC) penalty, which is a folded concave penalty that maintains the convexity of the objective function. We develop a new method for grouped variable selection in linear regression, the group GMC, that generalizes the strategy of the original GMC estimator. We present a primal-dual algorithm for computing the group GMC estimator and also prove properties of the solution path to guide its numerical computation and tuning parameter selection in practice. We establish error bounds for both the group GMC and original GMC estimators. A rich set of simulation studies and a real data application indicate that the proposed group GMC approach outperforms existing methods in several different aspects under a wide array of scenarios.

Funding Statement

Aaron J. Molstad’s research was supported by NSF DMS-2113589. Eric C. Chi’s research was supported by NSF DMS-2201136.

Acknowledgments

The authors are grateful to the editor, the associate editor, and the two referees for their helpful comments and suggestions.

Citation

Download Citation

Xiaoqian Liu. Aaron J. Molstad. Eric C. Chi. "A convex-nonconvex strategy for grouped variable selection." Electron. J. Statist. 17 (2) 2912 - 2961, 2023. https://doi.org/10.1214/23-EJS2167

Information

Received: 1 February 2022; Published: 2023
First available in Project Euclid: 10 November 2023

Digital Object Identifier: 10.1214/23-EJS2167

Keywords: Convex optimization , convex-nonconvex penalization , high-dimensional data analysis , sparse linear regression

Vol.17 • No. 2 • 2023
Back to Top