Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
124 tokens/sec
GPT-4o
8 tokens/sec
Gemini 2.5 Pro Pro
47 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Learning Directed Acyclic Graphs from Partial Orderings (2403.16031v1)

Published 24 Mar 2024 in stat.ML, cs.LG, and stat.ME

Abstract: Directed acyclic graphs (DAGs) are commonly used to model causal relationships among random variables. In general, learning the DAG structure is both computationally and statistically challenging. Moreover, without additional information, the direction of edges may not be estimable from observational data. In contrast, given a complete causal ordering of the variables, the problem can be solved efficiently, even in high dimensions. In this paper, we consider the intermediate problem of learning DAGs when a partial causal ordering of variables is available. We propose a general estimation framework for leveraging the partial ordering and present efficient estimation algorithms for low- and high-dimensional problems. The advantages of the proposed framework are illustrated via numerical studies.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (56)
  1. A Characterization of Markov Equivalence Classes for Acyclic Digraphs. The Annals of Statistics, 25(2):505–541, 1997.
  2. M. Azadkia and S. Chatterjee. A simple measure of conditional dependence. The Annals of Statistics, 49(6):3070–3102, 2021.
  3. R. B. Brem and L. Kruglyak. The landscape of genetic complexity across 5,700 gene expression traits in yeast. Proceedings of the National Academy of Sciences, 102(5):1572–1577, 2005.
  4. Cam: Causal additive models, high-dimensional order search and penalized regression. The Annals of Statistics, 42(6):2526–2556, 2014.
  5. S. Chakraborty and A. Shojaie. Nonparametric Causal Structure Learning in High Dimensions. Entropy, 24(3):351, Mar. 2022.
  6. On causal discovery with an equal-variance assumption. Biometrika, 106(4):973–980, Dec. 2019.
  7. D. M. Chickering. Learning Bayesian Networks is NP-Complete. In D. Fisher and H.-J. Lenz, editors, Learning from Data: Artificial Intelligence and Statistics V, Lecture Notes in Statistics, pages 121–130. Springer, New York, NY, 1996. doi: 10.1007/978-1-4612-2404-4˙12.
  8. Exploring regulation in tissues with eqtl networks. Proceedings of the National Academy of Sciences, 114(37):E7841–E7850, 2017.
  9. J. Fan and J. Lv. Sure independence screening for ultrahigh dimensional feature space. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 70(5):849–911, Nov. 2008.
  10. N. Friedman and D. Koller. Being bayesian about network structure. a bayesian approach to structure discovery in bayesian networks. Machine learning, 50(1):95–125, 2003.
  11. F. Fu and Q. Zhou. Learning sparse causal gaussian networks with experimental intervention: regularization and coordinate descent. Journal of the American Statistical Association, 108(501):288–300, 2013.
  12. A. Ghoshal and J. Honorio. Learning linear structural equation models in polynomial time and sample complexity. In International Conference on Artificial Intelligence and Statistics, pages 1466–1475. PMLR, Mar. 2018.
  13. S. Gupta and H. W. Kim. Linking structural equation modeling to bayesian networks: Decision support for customer retention in virtual communities. European Journal of Operational Research, 190(3):818–833, 2008.
  14. M. J. Ha and W. Sun. Estimation of high-dimensional directed acyclic graphs with surrogate intervention. Biostatistics, 21(4):659–675, Oct. 2020.
  15. Estimation of directed acyclic graphs through two-stage adaptive lasso for gene network inference. Journal of the American Statistical Association, 111(515):1004–1019, 2016.
  16. Generalized sparse additive models. Journal of Machine Learning Research, 23(70):1–56, 2022.
  17. N. Harris and M. Drton. PC Algorithm for Nonparanormal Graphical Models. Journal of Machine Learning Research, 14(69):3365–3383, 2013.
  18. D. Heckerman. Bayesian networks for data mining. Data mining and knowledge discovery, 1(1):79–119, 1997.
  19. Kernel partial correlation coefficient—a measure of conditional dependence. The Journal of Machine Learning Research, 23(1):9699–9756, 2022.
  20. A. Javanmard and A. Montanari. Hypothesis Testing in High-Dimensional Regression Under the Gaussian Random Design Model: Asymptotic Theory. IEEE Transactions on Information Theory, 60(10):6522–6554, Oct. 2014. ISSN 1557-9654. doi: 10.1109/TIT.2014.2343629.
  21. M. Kalisch and P. Bühlmann. Estimating High-Dimensional Directed Acyclic Graphs with the PC-Algorithm. Journal of Machine Learning Research, 8(22):613–636, 2007.
  22. Integrative qtl analysis of gene expression and chromatin accessibility identifies multi-tissue patterns of genetic regulation. PLoS genetics, 16(1):e1008537, 2020.
  23. Consistent Second-Order Conic Integer Programming for Learning Bayesian Networks. Journal of Machine Learning Research (JMLR), 24:1–38, 2023.
  24. Fine-mapping cellular qtls with rasqual and atac-seq. Nature genetics, 48(2):206–213, 2016.
  25. The Nonparanormal: Semiparametric Estimation of High Dimensional Undirected Graphs. Journal of Machine Learning Research, 10(80):2295–2328, 2009.
  26. High-dimensional semiparametric Gaussian copula graphical models. The Annals of Statistics, 40(4):2293–2326, Aug. 2012.
  27. Handbook of Graphical Models. CRC Press, Boca Raton, Nov. 2018. ISBN 978-0-429-46397-6. doi: 10.1201/9780429463976.
  28. Integer Programming for Learning Directed Acyclic Graphs from Continuous Data. INFORMS Journal on Optimization, 3(1):46–73, Jan. 2021.
  29. F. Markowetz and R. Spang. Inferring cellular networks–a review. BMC bioinformatics, 8(6):1–17, 2007.
  30. C. Meek. Causal inference and causal explanation with background knowledge. In Proceedings of the Eleventh conference on Uncertainty in artificial intelligence, UAI’95, pages 403–410, San Francisco, CA, USA, Aug. 1995. Morgan Kaufmann Publishers Inc. ISBN 978-1-55860-385-1.
  31. Expression quantitative trait loci: present and future. Philosophical Transactions of the Royal Society B: Biological Sciences, 368(1620):20120362, 2013.
  32. J. Pearl. Causality: Models, Reasoning and Inference. Cambridge University Press, USA, 2nd edition, 2009. ISBN 978-0-521-89560-6.
  33. Complete Graphical Characterization and Construction of Adjustment Sets in Markov Equivalence Classes of Ancestral Graphs. Journal of Machine Learning Research, 18(220):1–62, 2018.
  34. Causal discovery with continuous additive noise models. Journal of Machine Learning Research, 15:2009–2053, 2014a.
  35. Causal Discovery with Continuous Additive Noise Models. Journal of Machine Learning Research, 15(58):2009–2053, 2014b.
  36. Adjacency-faithfulness and conservative causal inference. In Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence, UAI’06, pages 401–408, Arlington, Virginia, USA, July 2006. AUAI Press.
  37. G. Raskutti and C. Uhler. Learning directed acyclic graph models based on sparsest permutations. Stat, 7(1):e183, 2018.
  38. R. D. Shah and J. Peters. The hardness of conditional independence testing and the generalised covariance measure. The Annals of Statistics, 48(3):1514–1538, 2020.
  39. On azadkia-chatterjee’s conditional dependence coefficient. arXiv preprint arXiv:2108.06827, 2021.
  40. A Linear Non-Gaussian Acyclic Model for Causal Discovery. Journal of Machine Learning Research, 7(72):2003–2030, 2006.
  41. A. Shojaie and G. Michailidis. Penalized likelihood methods for estimation of sparse high-dimensional directed acyclic graphs. Biometrika, 97(3):519–538, Sept. 2010.
  42. Inferring regulatory networks by combining perturbation screens and steady state gene expression profiles. PloS One, 9(2):e82393, 2014. ISSN 1932-6203. doi: 10.1371/journal.pone.0082393.
  43. A. Sondhi and A. Shojaie. The Reduced PC-Algorithm: Improved Causal Structure Learning in Large Random Networks. Journal of Machine Learning Research, 20(164):1–31, 2019. ISSN 1533-7928.
  44. Causation, Prediction, and Search, 2nd Edition, volume 1. The MIT Press, 1 edition, 2001.
  45. W. Sun. A statistical framework for eqtl mapping using rna-seq data. Biometrics, 68(1):1–11, 2012.
  46. Z. Tan and C.-H. Zhang. Doubly penalized estimation in additive regression with high-dimensional data. The Annals of Statistics, 47(5):2567–2600, Oct. 2019.
  47. On asymptotically optimal confidence regions and tests for high-dimensional models. The Annals of Statistics, 42(3):1166–1202, June 2014.
  48. Graph estimation with joint additive models. Biometrika, 101(1):85–101, Mar. 2014.
  49. M. J. Wainwright. High-Dimensional Statistics: A Non-Asymptotic Viewpoint. Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, Cambridge, 2019. ISBN 978-1-108-49802-9. doi: 10.1017/9781108627771.
  50. P.-L. Wang and G. Michailidis. Directed acyclic graph reconstruction leveraging prior partial ordering information. In International Conference on Machine Learning, Optimization, and Data Science, pages 458–471. Springer, 2019.
  51. Y. S. Wang and M. Drton. High-dimensional causal discovery under non-Gaussianity. Biometrika, 107(1):41–59, Mar. 2020. ISSN 0006-3444.
  52. L. Xue and H. Zou. Regularized rank-based estimation of high-dimensional nonparanormal graphical models. The Annals of Statistics, 40(5):2541–2571, Oct. 2012.
  53. Directed graphical models and causal discovery for zero-inflated data. In Conference on Causal Learning and Reasoning, pages 27–67. PMLR, 2023.
  54. Integrated systems approach identifies genetic nodes and networks in late-onset alzheimer’s disease. Cell, 153(3):707–720, 2013.
  55. Confidence intervals for low dimensional parameters in high dimensional linear models. Journal of the Royal Statistical Society Series B: Statistical Methodology, 76(1):217–242, 2014.
  56. J. Zhang and P. Spirtes. Strong faithfulness and uniform consistency in causal inference. In Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence, UAI’03, pages 632–639, San Francisco, CA, USA, Aug. 2002. Morgan Kaufmann Publishers Inc. ISBN 978-0-12-705664-7.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com