Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation (2405.06424v2)

Published 10 May 2024 in cs.CL, cs.AI, and cs.LG

Abstract: Assessing response quality to instructions in LLMs is vital but challenging due to the complexity of human language across different contexts. This complexity often results in ambiguous or inconsistent interpretations, making accurate assessment difficult. To address this issue, we propose a novel Uncertainty-aware Reward Model (URM) that introduces a robust uncertainty estimation for the quality of paired responses based on Bayesian approximation. Trained with preference datasets, our uncertainty-enabled proxy not only scores rewards for responses but also evaluates their inherent uncertainty. Empirical results demonstrate significant benefits of incorporating the proposed proxy into LLM training. Our method boosts the instruction following capability of LLMs by refining data curation for training and improving policy optimization objectives, thereby surpassing existing methods by a large margin on benchmarks such as Vicuna and MT-bench. These findings highlight that our proposed approach substantially advances LLM training and paves a new way of harnessing uncertainty within LLMs.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (14)
  1. Jae Oh Woo (13 papers)
  2. Juree Seok (2 papers)
  3. Parisa Hassanzadeh (19 papers)
  4. Wooseok Jang (12 papers)
  5. JuYoun Son (2 papers)
  6. Sima Didari (6 papers)
  7. Baruch Gutow (1 paper)
  8. Heng Hao (14 papers)
  9. Hankyu Moon (6 papers)
  10. Wenjun Hu (14 papers)
  11. Yeong-Dae Kwon (11 papers)
  12. Taehee Lee (6 papers)
  13. Seungjai Min (7 papers)
  14. Joonho Lee (104 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets