Open Access
June 2019 Causal Dantzig: Fast inference in linear structural equation models with hidden variables under additive interventions
Dominik Rothenhäusler, Peter Bühlmann, Nicolai Meinshausen
Ann. Statist. 47(3): 1688-1722 (June 2019). DOI: 10.1214/18-AOS1732


Causal inference is known to be very challenging when only observational data are available. Randomized experiments are often costly and impractical and in instrumental variable regression the number of instruments has to exceed the number of causal predictors. It was recently shown in Peters, Bühlmann and Meinshausen (2016) (J. R. Stat. Soc. Ser. B. Stat. Methodol. 78 947–1012) that causal inference for the full model is possible when data from distinct observational environments are available, exploiting that the conditional distribution of a response variable is invariant under the correct causal model. Two shortcomings of such an approach are the high computational effort for large-scale data and the assumed absence of hidden confounders. Here, we show that these two shortcomings can be addressed if one is willing to make a more restrictive assumption on the type of interventions that generate different environments. Thereby, we look at a different notion of invariance, namely inner-product invariance. By avoiding a computationally cumbersome reverse-engineering approach such as in Peters, Bühlmann and Meinshausen (2016), it allows for large-scale causal inference in linear structural equation models. We discuss identifiability conditions for the causal parameter and derive asymptotic confidence intervals in the low-dimensional setting. In the case of nonidentifiability, we show that the solution set of causal Dantzig has predictive guarantees under certain interventions. We derive finite-sample bounds in the high-dimensional setting and investigate its performance on simulated datasets.


Download Citation

Dominik Rothenhäusler. Peter Bühlmann. Nicolai Meinshausen. "Causal Dantzig: Fast inference in linear structural equation models with hidden variables under additive interventions." Ann. Statist. 47 (3) 1688 - 1722, June 2019.


Received: 1 June 2017; Revised: 1 April 2018; Published: June 2019
First available in Project Euclid: 13 February 2019

zbMATH: 07053523
MathSciNet: MR3911127
Digital Object Identifier: 10.1214/18-AOS1732

Primary: 62H99 , 62J99
Secondary: 68T99

Keywords: Causal inference , high-dimensional consistency , structural equation models

Rights: Copyright © 2019 Institute of Mathematical Statistics

Vol.47 • No. 3 • June 2019
Back to Top