Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Generalized Causal Tree for Uplift Modeling (2202.02416v2)

Published 4 Feb 2022 in stat.ME, stat.AP, and stat.ML

Abstract: Uplift modeling is crucial in various applications ranging from marketing and policy-making to personalized recommendations. The main objective is to learn optimal treatment allocations for a heterogeneous population. A primary line of existing work modifies the loss function of the decision tree algorithm to identify cohorts with heterogeneous treatment effects. Another line of work estimates the individual treatment effects separately for the treatment group and the control group using off-the-shelf supervised learning algorithms. The former approach that directly models the heterogeneous treatment effect is known to outperform the latter in practice. However, the existing tree-based methods are mostly limited to a single treatment and a single control use case, except for a handful of extensions to multiple discrete treatments. In this paper, we propose a generalization of tree-based approaches to tackle multiple discrete and continuous-valued treatments. We focus on a generalization of the well-known causal tree algorithm due to its desirable statistical properties, but our generalization technique can be applied to other tree-based approaches as well. The efficacy of our proposed method is demonstrated using experiments and real data examples.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (36)
  1. Online parameter selection for web-based ranking problems. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pages 23–32.
  2. Recursive partitioning for heterogeneous causal effects. Proceedings of the National Academy of Sciences, 113(27):7353–7360.
  3. Estimating the effects of continuous-valued interventions using generative adversarial networks. Advances in Neural Information Processing Systems, 33:16434–16445.
  4. Estimating the effects of continuous-valued interventions using generative adversarial networks. In Proceedings of the 34th International Conference on Neural Information Processing Systems. Curran Associates Inc.
  5. twang: Toolkit for Weighting and Analysis of Nonequivalent Groups. R package version 2.4.
  6. CausalML: Python package for causal machine learning. arXiv preprint arXiv:2002.11631.
  7. Nonparametric estimation of heterogeneous treatment effects: From theory to learning algorithms. In Proceedings of the 24th International Conference on Artificial Intelligence and Statistics.
  8. Modeling Heterogeneous Treatment Effects in Survey Experiments with Bayesian Additive Regression Trees. Public Opinion Quarterly, 76(3):491–511.
  9. Optimal personalized treatment rules for marketing interventions: A review of methods, a new proposal, and an insurance case study.
  10. A survey of learning causality with data: Problems and methods. ACM Comput.Surv., 53(4).
  11. Email volume optimization at linkedin. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 97–106.
  12. Causal inference and uplift modelling: A review of the literature. In International Conference on Predictive Applications and APIs, pages 1–13. PMLR.
  13. Incremental value modeling. Journal of Interactive Marketing, 16:35–46.
  14. Hill, J. L. (2011). Bayesian nonparametric modeling for causal inference. J Comput Graphical Stat, 20:217–240.
  15. Causal inference with general treatment regimes: Generalizing the propensity score. Journal of the American Statistical Association, 99(467):854–866.
  16. Imbens, G. (2000). The role of the propensity score in estimating dose-response functions. Biometrika, 87(3):706–710.
  17. Principal stratification for causal inference with extended partial compliance. Journal of the American Statistical Association, 103(481):101–111.
  18. Machine learning estimation of heterogeneous causal effects: Empirical Monte Carlo evidence. The Econometrics Journal, 24(1):134–161.
  19. Metalearners for estimating heterogeneous treatment effects using machine learning. Proceedings of the National Academy of Sciences, 116:4156–4165.
  20. Quantifying the effectiveness of advertising: A bootstrap proportion test for brand lift testing. In CIKM’23.
  21. A survey and benchmarking study of multitreatment uplift modeling. Data Mining and Knowledge Discovery, 34(2):273–308.
  22. Orthogonal random forest for causal inference. In International Conference on Machine Learning, pages 4932–4941. PMLR.
  23. Real-world uplift modelling with significance-based uplift trees. Stochastic Solutions White Paper.
  24. Rubin, D. B. (1974). Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology, 66(5):688.
  25. Decision trees for uplift modeling with single and multiple treatments. Knowledge and Information Systems, 32(2):303–327.
  26. Reliable decision support using counterfactual models. In Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS’17, page 1696–1706, Red Hook, NY, USA. Curran Associates Inc.
  27. Therneau, T. M. (2021). A Package for Survival Analysis in R. R package version 3.2-13.
  28. Personalized treatment selection using causal heterogeneity. In Proceedings of The Web Conference 2021, WWW ’21.
  29. van der Schaar, Y. Z. A. B. M. (2020). Learning overlapping representations for the estimation of individualized treatment effects. In Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics.
  30. Estimation and inference of heterogeneous treatment effects using random forests. Journal of the American Statistical Association, 113(523):1228–1242.
  31. Gcf: Generalized causal forest for heterogeneous treatment effect estimation in online marketplace. arXiv preprint arXiv:2203.10975.
  32. Support vector machines for uplift modeling. In 13th International Conference on Data Mining Workshops, pages 131–138.
  33. A unified survey of treatment effect heterogeneity modelling and uplift modelling. ACM Computing Surveys (CSUR), 54(8):1–36.
  34. Uplift modeling with multiple treatments and general response types. In Proceedings of the 2017 SIAM International Conference on Data Mining, pages 588–596. SIAM.
  35. Uplift modeling for multiple treatments with cost optimization. In 2019 IEEE International Conference on Data Science and Advanced Analytics (DSAA), pages 422–431. IEEE.
  36. Offline multi-action policy learning: Generalization and optimization. Operations Research, 71(1):148–183.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Preetam Nandy (17 papers)
  2. Xiufan Yu (9 papers)
  3. Wanjun Liu (3 papers)
  4. Ye Tu (5 papers)
  5. Kinjal Basu (49 papers)
  6. Shaunak Chatterjee (11 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.