Aligning LLM Agents by Learning Latent Preference from User Edits (2404.15269v3)

Published 23 Apr 2024 in cs.CL, cs.AI, cs.IR, and cs.LG

Abstract: We study interactive learning of LLM-based language agents based on user edits made to the agent's output. In a typical setting such as writing assistants, the user interacts with a language agent to generate a response given a context, and may optionally edit the agent response to personalize it based on their latent preference, in addition to improving the correctness. The edit feedback is naturally generated, making it a suitable candidate for improving the agent's alignment with the user's preference, and for reducing the cost of user edits over time. We propose a learning framework, PRELUDE that infers a description of the user's latent preference based on historic edit data. The inferred user preference descriptions are used to define prompts for generating responses in the future. This avoids fine-tuning the agent, which is costly, challenging to scale with the number of users, and may even degrade its performance on other tasks. Furthermore, learning descriptive preference improves interpretability, allowing the user to view and modify the learned preference. However, user preference can be complex, subtle, and vary based on context, making it challenging to learn. To address this, we propose a simple yet effective algorithm named CIPHER that leverages the LLM to infer the user preference for a given context based on user edits. In the future, CIPHER retrieves inferred preferences from the k-closest contexts in the history, and forms an aggregate preference for response generation. We introduce two interactive environments -- summarization and email writing, and use a GPT-4 simulated user for evaluation. On both tasks, CIPHER outperforms several baselines by achieving the lowest edit distance cost while only having a small overhead in LLM query cost. Our analysis reports that user preferences learned by CIPHER show significant similarity to the ground truth latent preferences.

Citations (12)

View on Semantic Scholar

Summary

The paper presents PRELUDE and CIPHER, frameworks that leverage user edits to infer latent preferences without retraining the model.
It employs a prompt policy approach by retrieving historical edits to dynamically incorporate user-specific feedback.
Empirical evaluations demonstrate a reduced edit distance in tasks like summarization and email writing, highlighting improved personalization.

Exploring Preference Learning through User Edits in LLMs

Introduction

Language agents, especially those powered by LLMs, are becoming increasingly integral to applications ranging from writing assistants to customer support. While LLMs exhibit robust zero-shot capabilities, their generic responses often lack personalization, which can be crucial for user-specific tasks. A natural and frequent form of user feedback in these applications is the edits users make to the responses generated by these agents. This paper introduces a novel learning framework called PRELUDE, which stands for PREference Learning from User's Direct Edits, focusing on harnessing these user edits not just to adjust responses on the fly but to understand and adapt to user-specific preferences over time.

The Mechanics of PRELUDE and CIPHER

PRELUDE does not fine-tune the underlying LLM, addressing scalability and cost issues associated with per-user model adjustments. Instead, it creates a 'prompt policy', which predicts and incorporates user preferences into the model's generation process based on previously observed edits. This mechanism involves generating inferring preferences without the computational and logistical overhead of model retraining.

A critical component of PRELUDE is an algorithm called CIPHER (Consolidates Induced Preferences based on Historical Edits with Retrieval). CIPHER operates by retrieving the user's past edits and preferences, identifying patterns, and using these to predict future preferences in similar contexts. Notably, this process typically requires shorter prompts compared to methods that use longer contextual retrievals, thus reducing the overhead on the model.

Empirical Evaluations and Findings

The authors tested CIPHER within two simulated interactive environments representing common use cases of LLMs: summarization and email writing tasks. The simulated users in these environments interacted with the agent, providing naturalistic edits based on latent preferences specific to different document types. For instance, a movie review might elicit preferences for a concise, point-wise summary, while an academic abstract might require detailed explanations.

In both environments, CIPHER demonstrated a stronger ability to reduce the edit distance - a metric quantifying the discrepancy between the generated text and the user-edited version - compared to several baseline methods. Specifically, it outperformed the no-learning baseline by adapting to user preferences over time, as indicated by the progressively decreasing requirement for user edits.

Theoretical Contributions and Practical Implications

Beyond empirical performance, the theoretical nuances of this work lie in its approach to aggregate and refine user preferences effectively. By leveraging historical data and avoiding the retraining of base models, PRELUDE and CIPHER bring an economically feasible personalization to LLMs without compromising the model's core capabilities.

For practical application, embedding a system like CIPHER into consumer-facing LLM applications could significantly enhance user satisfaction by reducing the need for constant corrections and providing responses that feel more intuitively aligned with individual user styles and preferences.

Future Prospects

Looking ahead, this approach could pave the way for more nuanced user-model interactions where the model not only responds accurately but also evolves in line with user preferences seamlessly. Further research might explore the limits of preference learning, particularly in how detailed and subtle preferences can be captured without explicit user feedback.

Furthermore, investigating the robustness of such systems in diverse real-world scenarios, outside of controlled experimental conditions, would be crucial to understanding their utility and areas for enhancement.

Conclusion

The development of CIPHER within the PRELUDE framework marks a significant step towards more personalized, user-aware applications of LLMs. By intelligently leveraging user edits, not merely as feedback but as a window into user preferences, this research contributes both to the enhancement of user experience and to the operational efficiency of deploying LLMs in personalized applications.

PDF Markdown

Related Papers

Tweets

https://twitter.com/ggaonlp/status/1786086823241855236

https://twitter.com/SwankyView/status/1851644090309283963

https://twitter.com/GptMaestro/status/1788220170441519401

YouTube

Show All Videos