Comparisons Are All You Need for Optimizing Smooth Functions (2405.11454v1)

Published 19 May 2024 in cs.LG, cs.DS, and math.OC

Abstract: When optimizing machine learning models, there are various scenarios where gradient computations are challenging or even infeasible. Furthermore, in reinforcement learning (RL), preference-based RL that only compares between options has wide applications, including reinforcement learning with human feedback in LLMs. In this paper, we systematically study optimization of a smooth function $f\colon\mathbb{R}^{n\to\mathbb{R}$} only assuming an oracle that compares function values at two points and tells which is larger. When $f$ is convex, we give two algorithms using $\tilde{O}(n/\epsilon)$ and $\tilde{O}(n^{2})$ comparison queries to find an $\epsilon$-optimal solution, respectively. When $f$ is nonconvex, our algorithm uses $\tilde{O}(n/\epsilon^2)$ comparison queries to find an $\epsilon$-approximate stationary point. All these results match the best-known zeroth-order algorithms with function evaluation queries in $n$ dependence, thus suggest that \emph{comparisons are all you need for optimizing smooth functions using derivative-free methods}. In addition, we also give an algorithm for escaping saddle points and reaching an $\epsilon$-second order stationary point of a nonconvex $f$, using $\tilde{O}(n^{{1.5}/\epsilon^{2.5})$} comparison queries.

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/AlgorithmPapers/status/1792804491789480218

Comparisons Are All You Need for Optimizing Smooth Functions (2405.11454v1)

Summary

Related Papers

Tweets