Open Access
June 2024 Reconciling model-X and doubly robust approaches to conditional independence testing
Ziang Niu, Abhinav Chakraborty, Oliver Dukes, Eugene Katsevich
Author Affiliations +
Ann. Statist. 52(3): 895-921 (June 2024). DOI: 10.1214/24-AOS2372

Abstract

Model-X approaches to testing conditional independence between a predictor and an outcome variable given a vector of covariates usually assume exact knowledge of the conditional distribution of the predictor given the covariates. Nevertheless, model-X methodologies are often deployed with this conditional distribution learned in sample. We investigate the consequences of this choice through the lens of the distilled conditional randomization test (dCRT). We find that Type-I error control is still possible, but only if the mean of the outcome variable given the covariates is estimated well enough. This demonstrates that the dCRT is doubly robust, and motivates a comparison to the generalized covariance measure (GCM) test, another doubly robust conditional independence test. We prove that these two tests are asymptotically equivalent, and show that the GCM test is optimal against (generalized) partially linear alternatives by leveraging semiparametric efficiency theory. In an extensive simulation study, we compare the dCRT to the GCM test. These two tests have broadly similar Type-I error and power, though dCRT can have somewhat better Type-I error control but somewhat worse power in small samples or when the response is discrete. We also find that post-lasso based test statistics (as compared to lasso based statistics) can dramatically improve Type-I error control for both methods.

Funding Statement

ZN was partially supported by the grant “Statistical Software for Single Cell CRISPR Screens” awarded to EK by Analytics at Wharton.
OD was partially supported by FWO Grant 1222522N and NIH Grant AG065276.
EK was partially supported by NSF Grants DMS-2113072 and DMS-2310654.

Acknowledgments

We acknowledge help from Timothy Barry with our simulation studies and the underlying computational infrastructure, including his simulatr R package and Nextflow pipeline.

We acknowledge dedicated support from the staff at the Wharton High Performance Computing Cluster. We acknowledge Lucas Janson for providing details about the simulation setting in Candès et al. (2018).

We acknowledge Eric Tchetgen Tchetgen for helpful discussions on hypothesis testing in the semiparametric models.

Finally, we acknowledge two referees and an Associate Editor for their insightful comments and suggestions, which helped improve this work.

The first and the second authors contributed equally to this work.

Citation

Download Citation

Ziang Niu. Abhinav Chakraborty. Oliver Dukes. Eugene Katsevich. "Reconciling model-X and doubly robust approaches to conditional independence testing." Ann. Statist. 52 (3) 895 - 921, June 2024. https://doi.org/10.1214/24-AOS2372

Information

Received: 1 February 2023; Revised: 1 February 2024; Published: June 2024
First available in Project Euclid: 11 August 2024

Digital Object Identifier: 10.1214/24-AOS2372

Subjects:
Primary: 62G09 , 62G10 , 62J07

Keywords: conditional independence testing , conditional randomization test , double robustness , model-X

Rights: Copyright © 2024 Institute of Mathematical Statistics

Vol.52 • No. 3 • June 2024
Back to Top