Bayesian Low-rank Adaptation for Large Language Models (2308.13111v5)

Published 24 Aug 2023 in cs.LG

Abstract: Low-rank adaptation (LoRA) has emerged as a new paradigm for cost-efficient fine-tuning of LLMs. However, fine-tuned LLMs often become overconfident especially when fine-tuned on small datasets. Bayesian methods, with their inherent ability to estimate uncertainty, serve as potent tools to mitigate overconfidence and enhance calibration. In this work, we introduce Laplace-LoRA, which applies a Bayesian approach to the LoRA parameters. Specifically, Laplace-LoRA applies a Laplace approximation to the posterior over the LoRA parameters, considerably improving the calibration of fine-tuned LLMs.

References (72)

Citations (30)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/adam_x_yang/status/1757135327708282915

https://twitter.com/3rp3l/status/1785803593720664161

https://twitter.com/laurence_ai/status/1799006131077198226

https://twitter.com/maxime_robeyns/status/1787049759334605177

https://twitter.com/maxime_robeyns/status/1757318592327024889

Bayesian Low-rank Adaptation for Large Language Models (2308.13111v5)

Summary

Related Papers

Tweets