June 2021 Causal discovery in heavy-tailed models
Nicola Gnecco, Nicolai Meinshausen, Jonas Peters, Sebastian Engelke
Author Affiliations +
Ann. Statist. 49(3): 1755-1778 (June 2021). DOI: 10.1214/20-AOS2021

Abstract

Causal questions are omnipresent in many scientific problems. While much progress has been made in the analysis of causal relationships between random variables, these methods are not well suited if the causal mechanisms only manifest themselves in extremes. This work aims to connect the two fields of causal inference and extreme value theory. We define the causal tail coefficient that captures asymmetries in the extremal dependence of two random variables. In the population case, the causal tail coefficient is shown to reveal the causal structure if the distribution follows a linear structural causal model. This holds even in the presence of latent common causes that have the same tail index as the observed variables. Based on a consistent estimator of the causal tail coefficient, we propose a computationally highly efficient algorithm that estimates the causal structure. We prove that our method consistently recovers the causal order and we compare it to other well-established and nonextremal approaches in causal discovery on synthetic and real data. The code is available as an open-access R package.

Funding Statement

JP was supported by research grants from VILLUM FONDEN and the Carlsberg Foundation, and SE was supported by the Swiss National Science Foundation.

Acknowledgments

We thank Cesare Miglioli, Stanislav Volgushev and Linbo Wang for helpful discussions. We are grateful to the editorial team and two anonymous referees for constructive comments that improved the paper.

Citation

Download Citation

Nicola Gnecco. Nicolai Meinshausen. Jonas Peters. Sebastian Engelke. "Causal discovery in heavy-tailed models." Ann. Statist. 49 (3) 1755 - 1778, June 2021. https://doi.org/10.1214/20-AOS2021

Information

Received: 1 November 2019; Revised: 1 September 2020; Published: June 2021
First available in Project Euclid: 9 August 2021

MathSciNet: MR4298880
zbMATH: 1475.62130
Digital Object Identifier: 10.1214/20-AOS2021

Subjects:
Primary: 62Hxx
Secondary: 62G08 , 62G32

Keywords: causality , Extreme value theory , heavy-tailed distributions , nonparametric estimation

Rights: Copyright © 2021 Institute of Mathematical Statistics

Vol.49 • No. 3 • June 2021
Back to Top