Last Round Convergence and No-Instant Regret in Repeated Games with Asymmetric Information (2003.11727v2)

Published 26 Mar 2020 in cs.GT

Abstract: This paper considers repeated games in which one player has more information about the game than the other players. In particular, we investigate repeated two-player zero-sum games where only the column player knows the payoff matrix A of the game. Suppose that while repeatedly playing this game, the row player chooses her strategy at each round by using a no-regret algorithm to minimize her (pseudo) regret. We develop a no-instant-regret algorithm for the column player to exhibit last round convergence to a minimax equilibrium. We show that our algorithm is efficient against a large set of popular no-regret algorithms of the row player, including the multiplicative weight update algorithm, the online mirror descent method/follow-the-regularized-leader, the linear multiplicative weight update algorithm, and the optimistic multiplicative weight update.

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Last Round Convergence and No-Instant Regret in Repeated Games with Asymmetric Information (2003.11727v2)

Summary

Related Papers