Neural Networks with Causal Graph Constraints: A New Approach for Treatment Effects Estimation (2404.12238v1)
Abstract: In recent years, there has been a growing interest in using machine learning techniques for the estimation of treatment effects. Most of the best-performing methods rely on representation learning strategies that encourage shared behavior among potential outcomes to increase the precision of treatment effect estimates. In this paper we discuss and classify these models in terms of their algorithmic inductive biases and present a new model, NN-CGC, that considers additional information from the causal graph. NN-CGC tackles bias resulting from spurious variable interactions by implementing novel constraints on models, and it can be integrated with other representation learning methods. We test the effectiveness of our method using three different base models on common benchmarks. Our results indicate that our model constraints lead to significant improvements, achieving new state-of-the-art results in treatment effects estimation. We also show that our method is robust to imperfect causal graphs and that using partial causal information is preferable to ignoring it.
- Representation learning: A review and new perspectives. IEEE transactions on pattern analysis and machine intelligence, 35(8):1798ā1828, 2013.
- The causal cookbook: Recipes for propensity scores, g-computation, and doubly robust standardization. 2023.
- Causalml: Python package for causal machine learning. arXiv preprint arXiv:2002.11631, 2020.
- A crash course in good and bad controls. Sociological Methods & Research, page 00491241221099552, 2022.
- Alicia Curth and Mihaela vanĀ der Schaar. Nonparametric estimation of heterogeneous treatment effects: From theory to learning algorithms. In International Conference on Artificial Intelligence and Statistics, pages 1810ā1818. PMLR, 2021.
- Really doing great at estimating cate? a critical look at ml benchmarking practices in treatment effect estimation. In Thirty-fifth conference on neural information processing systems datasets and benchmarks track (round 2), 2021.
- Propensity score-matching methods for nonexperimental causal studies. Review of Economics and statistics, 84(1):151ā161, 2002.
- Vincent Dorie. Npci: Non-parametrics for causal inference. URL: https://github. com/vdorie/npci, 11:23, 2016.
- How to select predictive models for decision making or causal inference. Available at SSRN 4467871, 2023.
- Assessing spurious interaction effects in structural equation modeling: A cautionary note. Educational and psychological measurement, 75(5):721ā738, 2015.
- Counterfactual regression with importance sampling weights. In IJCAI, pages 5880ā5887, 2019.
- Graphical criteria for efficient total effect estimation via adjustment in causal linear models. Journal of the Royal Statistical Society Series B: Statistical Methodology, 84(2):579ā599, 2022.
- JenniferĀ L Hill. Bayesian nonparametric modeling for causal inference. Journal of Computational and Graphical Statistics, 20(1):217ā240, 2011.
- Learning representations for counterfactual inference. In International conference on machine learning, pages 3020ā3029. PMLR, 2016.
- Causal machine learning: A survey and open problems. arXiv preprint arXiv:2206.15475, 2022.
- Probabilistic graphical models: principles and techniques. MIT press, 2009.
- Metalearners for estimating heterogeneous treatment effects using machine learning. Proceedings of the national academy of sciences, 116(10):4156ā4165, 2019.
- RobertĀ J LaLonde. Evaluating the econometric evaluations of training programs with experimental data. The American economic review, pages 604ā620, 1986.
- Estimating treatment effects under heterogeneous interference. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pages 576ā592. Springer, 2023.
- Causal effect inference with deep latent-variable models. Advances in neural information processing systems, 30, 2017.
- What can be estimated? identifiability, estimability, causal inference and ill-posed inverse problems. arXiv preprint arXiv:1904.02826, 2019.
- Brady Neal. Introduction to causal inference. Course Lecture Notes (draft), 2020.
- Quasi-oracle estimation of heterogeneous treatment effects. Biometrika, 108(2):299ā319, 2021.
- B-learner: Quasi-oracle bounds on heterogeneous causal effects under hidden confounding. arXiv preprint arXiv:2304.10577, 2023.
- Estimand-agnostic causal query estimation with deep causal graphs. IEEE Access, 10:71370ā71386, 2022.
- Judea Pearl. Bayesian analysis in expert systems: comment: graphical models, causality and intervention. Statistical Science, 8(3):266ā269, 1993.
- Judea Pearl. Causality. Cambridge university press, 2009.
- Efficient adjustment sets for population average causal treatment effect estimation in graphical models. The Journal of Machine Learning Research, 21(1):7642ā7727, 2020.
- DonaldĀ B Rubin. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of educational Psychology, 66(5):688, 1974.
- Toward causal representation learning. Proceedings of the IEEE, 109(5):612ā634, 2021.
- Estimating individual treatment effect: generalization bounds and algorithms. In Doina Precup and YeeĀ Whye Teh, editors, Proceedings of the 34th International Conference on Machine Learning, volumeĀ 70 of Proceedings of Machine Learning Research, pages 3076ā3085. PMLR, 06ā11 Aug 2017.
- Adapting neural networks for the estimation of treatment effects. In H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc., 2019.
- A linear non-gaussian acyclic model for causal discovery. Journal of Machine Learning Research, 7(10), 2006.
- Learning end-to-end patient representations through self-supervised covariate balancing for causal treatment effect estimation. Journal of Biomedical Informatics, 140:104339, 2023.
- Magne Thoresen. Spurious interaction as a result of categorization. BMC medical research methodology, 19(1):1ā8, 2019.
- The causal-neural connection: Expressiveness, learnability, and inference. Advances in Neural Information Processing Systems, 34:10823ā10836, 2021.
- Ganite: Estimation of individualized treatment effects using generative adversarial nets. In International conference on learning representations, 2018.
- gcastle: A python toolbox for causal discovery. arXiv preprint arXiv:2111.15155, 2021.
Collections
Sign up for free to add this paper to one or more collections.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.