Abstract
We consider estimation of average treatment effects given observational data with high-dimensional pretreatment variables. Existing methods for this problem typically assume some form of sparsity for the regression functions. In this work, we introduce a debiased inverse propensity score weighting (DIPW) scheme for average treatment effect estimation that delivers -consistent estimates when the propensity score follows a sparse logistic regression model; the outcome regression functions are permitted to be arbitrarily complex. We further demonstrate how confidence intervals centred on our estimates may be constructed. Our theoretical results quantify the price to pay for permitting the regression functions to be unestimable, which shows up as an inflation of the variance of the estimator compared to the semiparametric efficient variance by a constant factor, under mild conditions. We also show that when outcome regressions can be estimated consistently, our estimator achieves semiparametric efficiency. As our results accommodate arbitrary outcome regression functions, averages of transformed responses under each treatment may also be estimated at the rate. Thus, for example, the variances of the potential outcomes may be estimated. We discuss extensions to estimating linear projections of the heterogeneous treatment effect function and explain how propensity score models with more general link functions may be handled within our framework. An R package implementing our methodology is available on CRAN.
Funding Statement
The research of both authors was supported by an Engineering and Physical Sciences Research Council (EPSRC) ‘First’ grant of RDS (EP/R013381/1).
The research of YW was additionally supported by National Key R & D Program (2022YFA1008100), the 2030 Innovation Megaprojects of China (Programme on New Generation Artificial Intelligence) Grant No. 2021AAA0150000, and the grant of National Natural Science Foundation of China (NSFC) 12201341.
Acknowledgments
The authors would like to thank the Editor, the Associate Editor and three anonymous referees for helpful comments that improved the paper. The authors would also like to thank Lin Liu for his help on the lower bound analysis (presented in Section S9).
YW is also affiliated with Shanghai Artificial Intelligence Laboratory and Shanghai Qi Zhi Institute.
Part of this work was done while YW was at the University of Cambridge, UK.
Citation
Yuhao Wang. Rajen D. Shah. "Debiased inverse propensity score weighting for estimation of average treatment effects with high-dimensional confounders." Ann. Statist. 52 (5) 1978 - 2003, October 2024. https://doi.org/10.1214/24-AOS2409
Information