Private Fine-tuning of Large Language Models with Zeroth-order Optimization (2401.04343v3)

Published 9 Jan 2024 in cs.LG, cs.CL, and cs.CR

Abstract: Differentially private stochastic gradient descent (DP-SGD) allows models to be trained in a privacy-preserving manner, but has proven difficult to scale to the era of foundation models. We introduce DP-ZO, a private fine-tuning framework for LLMs by privatizing zeroth order optimization methods. A key insight into the design of our method is that the direction of the gradient in the zeroth-order optimization we use is random and the only information from training data is the step size, i.e., a scalar. Therefore, we only need to privatize the scalar step size, which is memory-efficient. DP-ZO provides a strong privacy-utility trade-off across different tasks, and model sizes that are comparable to DP-SGD in $(\varepsilon,\delta)$-DP. Notably, DP-ZO possesses significant advantages over DP-SGD in memory efficiency, and obtains higher utility in $\varepsilon$-DP when using the Laplace mechanism.

References (59)

Citations (11)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/PandaAshwinee/status/1828515009049305220

https://twitter.com/PandaAshwinee/status/1768763350031241729

Private Fine-tuning of Large Language Models with Zeroth-order Optimization (2401.04343v3)

Summary

Related Papers

Tweets