May 2024 Bayesian Sample Size Determination for Causal Discovery
Federico Castelletti, Guido Consonni
Author Affiliations +
Statist. Sci. 39(2): 305-321 (May 2024). DOI: 10.1214/23-STS905

Abstract

Graphical models based on Directed Acyclic Graphs (DAGs) are widely used to answer causal questions across a variety of scientific and social disciplines. However, observational data alone cannot distinguish in general between DAGs representing the same conditional independence assertions (Markov equivalent DAGs); as a consequence, the orientation of some edges in the graph remains indeterminate. Interventional data, produced by exogenous manipulations of variables in the network, enhance the process of structure learning because they allow to distinguish among equivalent DAGs, thus sharpening causal inference. Starting from an equivalence class of DAGs, a few procedures have been devised to produce a collection of variables to be manipulated in order to identify a causal DAG. Yet, these algorithmic approaches do not determine the sample size of the interventional data required to obtain a desired level of statistical accuracy. We tackle this problem from a Bayesian experimental design perspective, taking as input a sequence of target variables to be manipulated to identify edge orientation. We then propose a method to determine, at each intervention, the optimal sample size to produce an experiment which, with high assurance, will deliver an overall probability of decisive and correct evidence.

Funding Statement

Work partially supported by UCSC (D1 and 2019-D.3.2 research grants).

Acknowledgments

The authors would like to thank the Editor, an Associate Editor and two anonymous reviewers for their useful comments which helped improve the clarity of the paper.

Citation

Download Citation

Federico Castelletti. Guido Consonni. "Bayesian Sample Size Determination for Causal Discovery." Statist. Sci. 39 (2) 305 - 321, May 2024. https://doi.org/10.1214/23-STS905

Information

Published: May 2024
First available in Project Euclid: 5 May 2024

Digital Object Identifier: 10.1214/23-STS905

Keywords: Active learning , Bayes factor , Bayesian experimental design , Directed acyclic graph , intervention

Rights: Copyright © 2024 Institute of Mathematical Statistics

JOURNAL ARTICLE
17 PAGES

This article is only available to subscribers.
It is not available for individual sale.
+ SAVE TO MY LIBRARY

Vol.39 • No. 2 • May 2024
Back to Top