Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Combining Experimental and Historical Data for Policy Evaluation (2406.00317v1)

Published 1 Jun 2024 in stat.ML, cs.LG, and stat.ME

Abstract: This paper studies policy evaluation with multiple data sources, especially in scenarios that involve one experimental dataset with two arms, complemented by a historical dataset generated under a single control arm. We propose novel data integration methods that linearly integrate base policy value estimators constructed based on the experimental and historical data, with weights optimized to minimize the mean square error (MSE) of the resulting combined estimator. We further apply the pessimistic principle to obtain more robust estimators, and extend these developments to sequential decision making. Theoretically, we establish non-asymptotic error bounds for the MSEs of our proposed estimators, and derive their oracle, efficiency and robustness properties across a broad spectrum of reward shift scenarios. Numerical experiments and real-data-based analyses from a ridesharing company demonstrate the superior performance of the proposed estimators.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Ting Li (129 papers)
  2. Chengchun Shi (57 papers)
  3. Qianglin Wen (2 papers)
  4. Yang Sui (30 papers)
  5. Yongli Qin (1 paper)
  6. Chunbo Lai (1 paper)
  7. Hongtu Zhu (81 papers)

Summary

We haven't generated a summary for this paper yet.