Abstract
We propose a theoretical framework for the problem of learning a real-valued function which meets fairness requirements. This framework is built upon the notion of α-relative (fairness) improvement of the regression function which we introduce using the theory of optimal transport. Setting corresponds to the regression problem under the Demographic Parity constraint, while corresponds to the classical regression problem without any constraints. For the proposed framework allows to continuously interpolate between these two extreme cases and to study partially fair predictors. Within this framework, we precisely quantify the cost in risk induced by the introduction of the fairness constraint. We put forward a statistical minimax setup and derive a general problem-dependent lower bound on the risk of any estimator satisfying α-relative improvement constraint. We illustrate our framework on a model of linear regression with Gaussian design and systematic group-dependent bias, deriving matching (up to absolute constants) upper and lower bounds on the minimax risk under the introduced constraint. We provide a general post-processing strategy which enjoys fairness, risk guarantees and can be applied on top of any black-box algorithm. Finally, we perform a simulation study of the linear model and numerical experiments of benchmark data, validating our theoretical contributions.
Funding Statement
EC acknowledges the support of ANR-11-LABX-0056-LMH (Labex LMH) and of ANR PIA funding: ANR-20-IDEES-0002.
Acknowledgments
During the preparation of the main part of this work NS was affiliated with Institut Polytechnique de Paris, ENSAE, CREST.
The authors would like to thank Arnak Dalalyan and Mohamed Hebiri for their helpful comments.
Citation
Evgenii Chzhen. Nicolas Schreuder. "A minimax framework for quantifying risk-fairness trade-off in regression." Ann. Statist. 50 (4) 2416 - 2442, August 2022. https://doi.org/10.1214/22-AOS2198
Information