Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Generalization Bounds of Nonconvex-(Strongly)-Concave Stochastic Minimax Optimization (2205.14278v2)

Published 28 May 2022 in math.OC, cs.LG, and stat.ML

Abstract: This paper takes an initial step to systematically investigate the generalization bounds of algorithms for solving nonconvex-(strongly)-concave (NC-SC/NC-C) stochastic minimax optimization measured by the stationarity of primal functions. We first establish algorithm-agnostic generalization bounds via uniform convergence between the empirical minimax problem and the population minimax problem. The sample complexities for achieving $\epsilon$-generalization are $\tilde{\mathcal{O}}(d\kappa2\epsilon{-2})$ and $\tilde{\mathcal{O}}(d\epsilon{-4})$ for NC-SC and NC-C settings, respectively, where $d$ is the dimension and $\kappa$ is the condition number. We further study the algorithm-dependent generalization bounds via stability arguments of algorithms. In particular, we introduce a novel stability notion for minimax problems and build a connection between generalization bounds and the stability notion. As a result, we establish algorithm-dependent generalization bounds for stochastic gradient descent ascent (SGDA) algorithm and the more general sampling-determined algorithms.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Siqi Zhang (30 papers)
  2. Yifan Hu (89 papers)
  3. Liang Zhang (359 papers)
  4. Niao He (91 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.