Open Access
August 2017 Logistic Regression: From Art to Science
Dimitris Bertsimas, Angela King
Statist. Sci. 32(3): 367-384 (August 2017). DOI: 10.1214/16-STS602

Abstract

A high quality logistic regression model contains various desirable properties: predictive power, interpretability, significance, robustness to error in data and sparsity, among others. To achieve these competing goals, modelers incorporate these properties iteratively as they hone in on a final model. In the period 1991–2015, algorithmic advances in Mixed-Integer Linear Optimization (MILO) coupled with hardware improvements have resulted in an astonishing 450 billion factor speedup in solving MILO problems. Motivated by this speedup, we propose modeling logistic regression problems algorithmically with a mixed integer nonlinear optimization (MINLO) approach in order to explicitly incorporate these properties in a joint, rather than sequential, fashion. The resulting MINLO is flexible and can be adjusted based on the needs of the modeler. Using both real and synthetic data, we demonstrate that the overall approach is generally applicable and provides high quality solutions in realistic timelines as well as a guarantee of suboptimality. When the MINLO is infeasible, we obtain a guarantee that imposing distinct statistical properties is simply not feasible.

Citation

Download Citation

Dimitris Bertsimas. Angela King. "Logistic Regression: From Art to Science." Statist. Sci. 32 (3) 367 - 384, August 2017. https://doi.org/10.1214/16-STS602

Information

Published: August 2017
First available in Project Euclid: 1 September 2017

zbMATH: 06870251
MathSciNet: MR3696001
Digital Object Identifier: 10.1214/16-STS602

Keywords: computational statistics , logistic regression , mixed integer nonlinear optimization

Rights: Copyright © 2017 Institute of Mathematical Statistics

Vol.32 • No. 3 • August 2017
Back to Top