Abstract
In this paper, we propose differentially private algorithms for robust (multivariate) mean estimation and inference under heavy-tailed distributions, with a focus on Gaussian differential privacy. First, we provide a comprehensive analysis of the Huber mean estimator with increasing dimensions, including non-asymptotic deviation bound, Bahadur representation, and (uniform) Gaussian approximations. Secondly, we privatize the Huber mean estimator via noisy gradient descent, which is proven to achieve near-optimal statistical guarantees. The key is to characterize quantitatively the trade-off between statistical accuracy, degree of robustness and privacy level, governed by a carefully chosen robustification parameter. Finally, we construct private confidence intervals for the proposed estimator by incorporating a private and robust covariance estimator. Our findings are demonstrated by simulation studies.
Funding Statement
MY and WZ are supported in part by the NSF Grants DMS-2113409 and DMS-2401268. ZR is supported in part by the NSF Grant DMS-2113568 and the Central Research Development Fund at the University of Pittsburgh.
Acknowledgments
We thank the Editor, an Associate Editor, and three anonymous reviewers for their constructive comments and valuable suggestions, which have significantly helped us improve the quality of this work.
Citation
Myeonghun Yu. Zhao Ren. Wen-Xin Zhou. "Gaussian differentially private robust mean estimation and inference." Bernoulli 30 (4) 3059 - 3088, November 2024. https://doi.org/10.3150/23-BEJ1706
Information