Open Access
October 2022 Testing goodness-of-fit and conditional independence with approximate co-sufficient sampling
Rina Foygel Barber, Lucas Janson
Author Affiliations +
Ann. Statist. 50(5): 2514-2544 (October 2022). DOI: 10.1214/22-AOS2187

Abstract

Goodness-of-fit (GoF) testing is ubiquitous in statistics, with direct ties to model selection, confidence interval construction, conditional independence testing, and multiple testing, just to name a few applications. While testing the GoF of a simple (point) null hypothesis provides an analyst great flexibility in the choice of test statistic while still ensuring validity, most GoF tests for composite null hypotheses are far more constrained, as the test statistic must have a tractable distribution over the entire null model space. A notable exception is co-sufficient sampling (CSS): resampling the data conditional on a sufficient statistic for the null model guarantees valid GoF testing using any test statistic the analyst chooses. But CSS testing requires the null model to have a compact (in an information-theoretic sense) sufficient statistic, which only holds for a very limited class of models; even for a null model as simple as logistic regression, CSS testing is powerless. In this paper, we leverage the concept of approximate sufficiency to generalize CSS testing to essentially any parametric model with an asymptotically efficient estimator; we call our extension “approximate CSS” (aCSS) testing. We quantify the finite-sample Type I error inflation of aCSS testing and show that it is vanishing under standard maximum likelihood asymptotics, for any choice of test statistic. We apply our proposed procedure both theoretically and in simulation to a number of models of interest to demonstrate its finite-sample Type I error and power.

Funding Statement

The first author was supported by the National Science Foundation via grant DMS-1654076, and by the Office of Naval Research via grant N00014-20-1-2337.

Acknowledgments

The authors would like to thank Michael Bian for help with some of the computation.

Citation

Download Citation

Rina Foygel Barber. Lucas Janson. "Testing goodness-of-fit and conditional independence with approximate co-sufficient sampling." Ann. Statist. 50 (5) 2514 - 2544, October 2022. https://doi.org/10.1214/22-AOS2187

Information

Received: 1 January 2021; Revised: 1 February 2022; Published: October 2022
First available in Project Euclid: 27 October 2022

MathSciNet: MR4500617
zbMATH: 07628830
Digital Object Identifier: 10.1214/22-AOS2187

Subjects:
Primary: 62F03
Secondary: 62B05

Keywords: approximate sufficiency , conditional independence testing , conditional randomization test , co-sufficiency , Goodness-of-fit test , high-dimensional inference , model-X

Rights: Copyright © 2022 Institute of Mathematical Statistics

Vol.50 • No. 5 • October 2022
Back to Top