Adaptive Online Experimental Design for Causal Discovery (2405.11548v3)
Abstract: Causal discovery aims to uncover cause-and-effect relationships encoded in causal graphs by leveraging observational, interventional data, or their combination. The majority of existing causal discovery methods are developed assuming infinite interventional data. We focus on data interventional efficiency and formalize causal discovery from the perspective of online learning, inspired by pure exploration in bandit problems. A graph separating system, consisting of interventions that cut every edge of the graph at least once, is sufficient for learning causal graphs when infinite interventional data is available, even in the worst case. We propose a track-and-stop causal discovery algorithm that adaptively selects interventions from the graph separating system via allocation matching and learns the causal graph based on sampling history. Given any desired confidence value, the algorithm determines a termination condition and runs until it is met. We analyze the algorithm to establish a problem-dependent upper bound on the expected number of required interventional samples. Our proposed algorithm outperforms existing methods in simulations across various randomly generated causal graphs. It achieves higher accuracy, measured by the structural hamming distance (SHD) between the learned causal graph and the ground truth, with significantly fewer samples.
- A characterization of markov equivalence classes for acyclic digraphs. The Annals of Statistics, 25(2):505–541, 1997.
- Bayesdag: Gradient-based posterior sampling for causal discovery. arXiv preprint arXiv:2307.13917, 2023.
- Convex optimization. Cambridge university press, 2004.
- Adaptivity complexity for causal graph discovery. arXiv preprint arXiv:2306.05781, 2023.
- Unimodal bandits: Regret lower bounds and optimal algorithms. In International Conference on Machine Learning, pages 521–529. PMLR, 2014.
- Follow the leader if you can, hedge if you must. The Journal of Machine Learning Research, 15(1):1281–1316, 2014.
- Non-asymptotic pure exploration by solving games. Advances in Neural Information Processing Systems, 32, 2019.
- Optimal experiment design for causal discovery from fixed number of experiments. arXiv preprint arXiv:1702.08567, 2017.
- Sample efficient active learning of causal trees. Advances in Neural Information Processing Systems, 32, 2019.
- Characterization and greedy learning of interventional markov equivalence classes of directed acyclic graphs. The Journal of Machine Learning Research, 13(1):2409–2464, 2012.
- Two optimal strategies for active learning of causal models from interventional data. International Journal of Approximate Reasoning, 55(4):926–939, 2014.
- A bayesian approach to causal discovery. Technical report, Technical report msr-tr-97-05, Microsoft Research, 1997.
- Randomized experimental design for causal graph discovery. Advances in neural information processing systems, 27, 2014.
- Causal discovery toolbox: Uncovering causal relationships in python. The Journal of Machine Learning Research, 21(1):1406–1410, 2020.
- Gyula Katona. On separating systems of a finite set. Journal of Combinatorial Theory, 1(2):174–194, 1966.
- On the complexity of best arm identification in multi-armed bandit models. Journal of Machine Learning Research, 17:1–42, 2016.
- Cost-optimal learning of causal graphs. In International Conference on Machine Learning, pages 1875–1884. PMLR, 2017.
- Probabilistic graphical models: principles and techniques. MIT press, 2009.
- Daniel Král’. Coloring powers of chordal graphs. SIAM Journal on Discrete Mathematics, 18(3):451–461, 2004.
- Bandit algorithms. Cambridge University Press, 2020.
- Christopher Meek. Causal inference and causal explanation with background knowledge. In Proceedings of the Eleventh conference on Uncertainty in artificial intelligence, pages 403–410, 1995.
- Judea Pearl. Causality. Cambridge university press, 2009.
- Emilija Perkovic. Identifying causal effects in maximally oriented partially directed acyclic graphs. In Conference on Uncertainty in Artificial Intelligence, pages 530–539. PMLR, 2020.
- Elements of causal inference: foundations and learning algorithms. The MIT Press, 2017.
- Causal protein-signaling networks derived from multiparameter single-cell data. Science, 308(5721):523–529, 2005.
- Marco Scutari. Learning bayesian networks with the bnlearn r package. arXiv preprint arXiv:0908.3817, 2009.
- Learning causal graphs with small interventions. Advances in Neural Information Processing Systems, 28, 2015.
- David Siegmund. Sequential analysis: tests and confidence intervals. Springer Science & Business Media, 1985.
- Causation, prediction, and search. MIT press, 2000.
- Active structure learning of causal dags via directed clique trees. Advances in Neural Information Processing Systems, 33:21500–21511, 2020.
- M Subramani and GF Cooper. Causal discovery from medical textual data, 1999.
- Active bayesian causal inference. Advances in Neural Information Processing Systems, 35:16261–16275, 2022.
- Optimal learning for structured bandits. arXiv preprint arXiv:2007.07302, 2020.
- Equivalence and synthesis of causal models. In Probabilistic and causal inference: The works of Judea Pearl, pages 221–236. 2022.
- Ingo Wegener. On separating systems whose elements are sets of at most k elements. Discrete Mathematics, 28(2):219–222, 1979.
- Strong faithfulness and uniform consistency in causal inference. arXiv preprint arXiv:1212.2506, 2012.
- Muhammad Qasim Elahi (6 papers)
- Lai Wei (68 papers)
- Murat Kocaoglu (27 papers)
- Mahsa Ghasemi (20 papers)