AcceleratedLiNGAM: Learning Causal DAGs at the speed of GPUs (2403.03772v1)
Abstract: Existing causal discovery methods based on combinatorial optimization or search are slow, prohibiting their application on large-scale datasets. In response, more recent methods attempt to address this limitation by formulating causal discovery as structure learning with continuous optimization but such approaches thus far provide no statistical guarantees. In this paper, we show that by efficiently parallelizing existing causal discovery methods, we can in fact scale them to thousands of dimensions, making them practical for substantially larger-scale problems. In particular, we parallelize the LiNGAM method, which is quadratic in the number of variables, obtaining up to a 32-fold speed-up on benchmark datasets when compared with existing sequential implementations. Specifically, we focus on the causal ordering subprocedure in DirectLiNGAM and implement GPU kernels to accelerate it. This allows us to apply DirectLiNGAM to causal inference on large-scale gene expression data with genetic interventions yielding competitive results compared with specialized continuous optimization methods, and Var-LiNGAM for causal discovery on U.S. stock data.
- A causal inference approach for estimating effects of non-pharmaceutical interventions during covid-19 pandemic. Plos one, 17(9):e0265289, 2022.
- Inferring causal relationships between risk factors and outcomes from genome-wide association study data. Annual review of genomics and human genetics, 19:303–327, 2018.
- Flashattention: Fast and memory-efficient exact attention with io-awareness. Advances in Neural Information Processing Systems, 35:16344–16359, 2022.
- Perturb-seq: dissecting molecular circuits with scalable single-cell rna profiling of pooled genetic screens. cell, 167(7):1853–1866, 2016.
- Multimodal pooled perturb-cite-seq screens in patient models define mechanisms of cancer immune evasion. Nature genetics, 53(3):332–341, 2021.
- Using bayesian networks to analyze expression data. In Proceedings of the fourth annual international conference on Computational molecular biology, pages 127–135, 2000.
- Flashfftconv: Efficient convolutions for long sequences with tensor cores. arXiv preprint arXiv:2311.05908, 2023.
- Review of causal discovery methods based on graphical models. Frontiers in genetics, 10:524, 2019.
- John L Gustafson. Reevaluating amdahl’s law. Communications of the ACM, 31(5):532–533, 1988.
- Pairwise likelihood ratios for estimation of non-gaussian structural equation models. The Journal of Machine Learning Research, 14(1):111–152, 2013.
- Estimation of a structural vector autoregression model using non-gaussianity. Journal of Machine Learning Research, 11(5), 2010.
- Causal inference in statistics, social, and biomedical sciences. Cambridge University Press, 2015.
- Algorithmic bias in recidivism prediction: A causal perspective. In 34th AAAI Conference on Artificial Intelligence, AAAI 2020, pages 13839–13840. AAAI press, 2020.
- Scaling structural learning with no-bears to infer causal transcriptome networks. In PACIFIC SYMPOSIUM ON BIOCOMPUTING 2020, pages 391–402. World Scientific, 2019.
- Stein variational gradient descent: A general purpose bayesian inference algorithm. Advances in neural information processing systems, 29, 2016.
- The impact of incarceration on recidivism. Annual Review of Criminology, 5:133–152, 2022.
- Large-scale differentiable causal discovery of factor graphs. In Advances in Neural Information Processing Systems, 2022.
- Accelerating lingam causal discovery with massive parallel execution on supercomputer fugaku. IEICE TRANSACTIONS on Information and Systems, 105(12):2032–2039, 2022.
- Scalable causal discovery with score matching. arXiv preprint arXiv:2304.03382, 2023.
- On the role of sparsity and dag constraints for learning linear dags. Advances in Neural Information Processing Systems, 33:17943–17954, 2020.
- Structure learning with continuous optimization: A sober look and beyond. arXiv preprint arXiv:2304.02146, 2023.
- Inferring subnetworks from perturbed expression profiles. BIOINFORMATICS-OXFORD-, 17:S215–S224, 2001.
- Judea Pearl et al. Models, reasoning and inference. Cambridge, UK: CambridgeUniversityPress, 19(2):3, 2000.
- The causation of disease–the practical and ethical consequences of competing explanations. Medicine, Health Care and Philosophy, 9:293–306, 2006.
- Beware of the simulated dag! causal discovery benchmarks may be easy to game. Advances in Neural Information Processing Systems, 34:27772–27784, 2021.
- Paralingam: Parallel causal structure learning for linear non-gaussian acyclic models. Journal of Parallel and Distributed Computing, 176:114–127, 2023.
- A linear non-gaussian acyclic model for causal discovery. Journal of Machine Learning Research, 7(10), 2006.
- Directlingam: A direct method for learning a linear non-gaussian structural equation model. Journal of Machine Learning Research-JMLR, 12(Apr):1225–1248, 2011.
- On the identifiability of the post-nonlinear causal model. arXiv preprint arXiv:1205.2599, 2012.
- Dags with no tears: Continuous optimization for structure learning. Advances in neural information processing systems, 31, 2018.