MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation (2308.08239v2)

Published 16 Aug 2023 in cs.CL

Abstract: We propose MemoChat, a pipeline for refining instructions that enables LLMs to effectively employ self-composed memos for maintaining consistent long-range open-domain conversations. We demonstrate a long-range open-domain conversation through iterative "memorization-retrieval-response" cycles. This requires us to carefully design tailored tuning instructions for each distinct stage. The instructions are reconstructed from a collection of public datasets to teach the LLMs to memorize and retrieve past dialogues with structured memos, leading to enhanced consistency when participating in future conversations. We invite experts to manually annotate a test set designed to evaluate the consistency of long-range conversations questions. Experiments on three testing scenarios involving both open-source and API-accessible chatbots at scale verify the efficacy of MemoChat, which outperforms strong baselines. Our codes, data and models are available here: https://github.com/LuJunru/MemoChat.

Citations (22)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - LuJunru/MemoChat: MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation (15 stars)

MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation (2308.08239v2)

Summary

Related Papers

GitHub