Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
139 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Semiparametric Efficient Inference in Adaptive Experiments (2311.18274v3)

Published 30 Nov 2023 in stat.ML, cs.LG, and stat.ME

Abstract: We consider the problem of efficient inference of the Average Treatment Effect in a sequential experiment where the policy governing the assignment of subjects to treatment or control can change over time. We first provide a central limit theorem for the Adaptive Augmented Inverse-Probability Weighted estimator, which is semiparametric efficient, under weaker assumptions than those previously made in the literature. This central limit theorem enables efficient inference at fixed sample sizes. We then consider a sequential inference setting, deriving both asymptotic and nonasymptotic confidence sequences that are considerably tighter than previous methods. These anytime-valid methods enable inference under data-dependent stopping times (sample sizes). Additionally, we use propensity score truncation techniques from the recent off-policy estimation literature to reduce the finite sample variance of our estimator without affecting the asymptotic variance. Empirical results demonstrate that our methods yield narrower confidence sequences than those previously developed in the literature while maintaining time-uniform error control.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (24)
  1. Timothy B. Armstrong. Asymptotic efficiency bounds for a class of experimental designs. arXiv preprint 2205.02726, 2022.
  2. Akshay Balsubramani. Sharp finite-time iterated-logarithm martingale concentration. arXiv preprint 1405.2639, 2015.
  3. Sequential nonparametric testing with the law of the iterated logarithm. In Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, UAI’16, 2016.
  4. Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal, 21(1):C1–C68, 2018.
  5. CLIP-OGD: An experimental design for adaptive Neyman allocation in sequential experiments. In Advances in Neural Information Processing Systems, volume 37, 2023.
  6. Aryeh Dvoretzky. Asymptotic normality for sums of dependent random variables. In Proceedings of the Sixth Berkeley Symposium on Mathematical Statistics and Probability, volume 2, 1972.
  7. Confidence intervals for policy evaluation in adaptive experiments. Proceedings of the National Academy of Sciences, 118(15), 2021.
  8. Adaptive experimental design using the propensity score. Journal of Business & Economic Statistics, 29(1):96–108, 2011.
  9. Time-uniform, nonparametric, nonasymptotic confidence sequences. The Annals of Statistics, 49(2), 2021.
  10. Peeking at A/B Tests: Why it matters, and what to do about it. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2017.
  11. Efficient adaptive experimental design for average treatment effect estimation. arXiv preprint 2002.05308, 2021.
  12. M. Loève. Probability Theory. Graduate texts in mathematics. Springer, 1977.
  13. Game-theoretic statistics and safe anytime-valid inference. Statistical Science, 2023.
  14. Herbert Robbins. Some aspects of the sequential design of experiments. Bulletin of the American Mathematical Society, 58(5):527 – 535, 1952.
  15. Estimation of regression coefficients when some regressors are not always observed. Journal of the American Statistical Association, 89(427):846–866, 1994.
  16. Donald B. Rubin. Randomization analysis of experimental data: The Fisher randomization test comment. Journal of the American Statistical Association, 75(371):591–593, 1980.
  17. Donald B. Rubin. Comment: Which ifs have causal answers. Journal of the American Statistical Association, 81:961–962, 1986.
  18. On the near-optimality of betting confidence sets for bounded means. arXiv preprint 2310.01547, 2023.
  19. Multi-armed bandit experimental design: Online decision-making and adaptive inference. In Proceedings of The 26th International Conference on Artificial Intelligence and Statistics, volume 206, 2023.
  20. Jean Ville. Étude critique de la notion de collectif. 1939.
  21. Estimating means of bounded random variables by betting. Journal of the Royal Statistical Society Series B: Statistical Methodology, 2023.
  22. Time-uniform central limit theory and asymptotic confidence sequences. arXiv preprint 2103.06476, 2023.
  23. Anytime-valid off-policy inference for contextual bandits. ACM/IMS Journal of Data Science, 2024.
  24. Statistical inference with M-estimators on adaptively collected data. In Advances in Neural Information Processing Systems, volume 34, 2021.
Citations (6)

Summary

We haven't generated a summary for this paper yet.