No-Regret Algorithms for Time-Varying Bayesian Optimization (2102.06296v2)

Published 11 Feb 2021 in cs.LG

Abstract: In this paper, we consider the time-varying Bayesian optimization problem. The unknown function at each time is assumed to lie in an RKHS (reproducing kernel Hilbert space) with a bounded norm. We adopt the general variation budget model to capture the time-varying environment, and the variation is characterized by the change of the RKHS norm. We adapt the restart and sliding window mechanism to introduce two GP-UCB type algorithms: R-GP-UCB and SW-GP-UCB, respectively. We derive the first (frequentist) regret guarantee on the dynamic regret for both algorithms. Our results not only recover previous linear bandit results when a linear kernel is used, but complement the previous regret analysis of time-varying Gaussian process bandit under a Bayesian-type regularity assumption, i.e., each function is a sample from a Gaussian process.

Citations (16)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Related Papers

Bayesian Analysis of Combinatorial Gaussian Process Bandits (2023)
On the Sublinear Regret of GP-UCB (2023)
Weighted Gaussian Process Bandits for Non-stationary Environments (2021)
On Kernelized Multi-armed Bandits (2017)
Time-Varying Gaussian Process Bandit Optimization (2016)