Aligning Agents like Large Language Models (2406.04208v1)

Published 6 Jun 2024 in cs.LG and cs.AI

Abstract: Training agents to behave as desired in complex 3D environments from high-dimensional sensory information is challenging. Imitation learning from diverse human behavior provides a scalable approach for training an agent with a sensible behavioral prior, but such an agent may not perform the specific behaviors of interest when deployed. To address this issue, we draw an analogy between the undesirable behaviors of imitation learning agents and the unhelpful responses of unaligned LLMs. We then investigate how the procedure for aligning LLMs can be applied to aligning agents in a 3D environment from pixels. For our analysis, we utilize an academically illustrative part of a modern console game in which the human behavior distribution is multi-modal, but we want our agent to imitate a single mode of this behavior. We demonstrate that we can align our agent to consistently perform the desired mode, while providing insights and advice for successfully applying this approach to training agents. Project webpage at https://adamjelley.github.io/aligning-agents-like-LLMs .

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

Aligning Agents like LLMs | Adam Jelley¹², Yuhan Cao¹, David Bignell¹,</br>Sam Devlin¹, Tabish Rashid¹</br> ¹Microsoft Research Cambridge, ²University of Edinburgh</br></br> Website for “Aligning Agents like Large Language Models”. ArXiv: https://arxiv.org/abs/2406.04208

Tweets

https://twitter.com/rakeshgohel01/status/1876282310636810430

Aligning Agents like Large Language Models (2406.04208v1)

Summary

Related Papers

GitHub

Tweets