Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Finding Second-Order Stationary Points in Nonconvex-Strongly-Concave Minimax Optimization (2110.04814v3)

Published 10 Oct 2021 in math.OC and cs.LG

Abstract: We study the smooth minimax optimization problem $\min_{\bf x}\max_{\bf y} f({\bf x},{\bf y})$, where $f$ is $\ell$-smooth, strongly-concave in ${\bf y}$ but possibly nonconvex in ${\bf x}$. Most of existing works focus on finding the first-order stationary points of the function $f({\bf x},{\bf y})$ or its primal function $P({\bf x})\triangleq \max_{\bf y} f({\bf x},{\bf y})$, but few of them focus on achieving second-order stationary points. In this paper, we propose a novel approach for minimax optimization, called Minimax Cubic Newton (MCN), which could find an $\big(\varepsilon,\kappa{1.5}\sqrt{\rho\varepsilon}\,\big)$-second-order stationary point of $P({\bf x})$ with calling ${\mathcal O}\big(\kappa{1.5}\sqrt{\rho}\varepsilon{-1.5}\big)$ times of second-order oracles and $\tilde{\mathcal O}\big(\kappa{2}\sqrt{\rho}\varepsilon{-1.5}\big)$ times of first-order oracles, where $\kappa$ is the condition number and $\rho$ is the Lipschitz continuous constant for the Hessian of $f({\bf x},{\bf y})$. In addition, we propose an inexact variant of MCN for high-dimensional problems to avoid calling expensive second-order oracles. Instead, our method solves the cubic sub-problem inexactly via gradient descent and matrix Chebyshev expansion. This strategy still obtains the desired approximate second-order stationary point with high probability but only requires $\tilde{\mathcal O}\big(\kappa{1.5}\ell\varepsilon{-2}\big)$ Hessian-vector oracle calls and $\tilde{\mathcal O}\big(\kappa{2}\sqrt{\rho}\varepsilon{-1.5}\big)$ first-order oracle calls. To the best of our knowledge, this is the first work that considers the non-asymptotic convergence behavior of finding second-order stationary points for minimax problems without the convex-concave assumptions.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Luo Luo (36 papers)
  2. Yujun Li (17 papers)
  3. Cheng Chen (262 papers)
Citations (14)

Summary

We haven't generated a summary for this paper yet.