Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation (2402.18150v2)

Published 28 Feb 2024 in cs.CL, cs.AI, and cs.IR

Abstract: Retrieval-augmented generation (RAG) enhances LLMs by incorporating additional information from retrieval. However, studies have shown that LLMs still face challenges in effectively using the retrieved information, even ignoring it or being misled by it. The key reason is that the training of LLMs does not clearly make LLMs learn how to utilize input retrieved texts with varied quality. In this paper, we propose a novel perspective that considers the role of LLMs in RAG as ``Information Refiner'', which means that regardless of correctness, completeness, or usefulness of retrieved texts, LLMs can consistently integrate knowledge within the retrieved texts and model parameters to generate the texts that are more concise, accurate, and complete than the retrieved texts. To this end, we propose an information refinement training method named InFO-RAG that optimizes LLMs for RAG in an unsupervised manner. InFO-RAG is low-cost and general across various tasks. Extensive experiments on zero-shot prediction of 11 datasets in diverse tasks including Question Answering, Slot-Filling, LLMing, Dialogue, and Code Generation show that InFO-RAG improves the performance of LLaMA2 by an average of 9.39\% relative points. InFO-RAG also shows advantages in in-context learning and robustness of RAG.

References (51)

Authors (7)

Shicheng Xu (36 papers)
Liang Pang (94 papers)
Mo Yu (117 papers)
Fandong Meng (174 papers)
Huawei Shen (119 papers)
Xueqi Cheng (274 papers)
Jie Zhou (687 papers)

Citations (5)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/dippatel1994/status/1763190609232175210

https://twitter.com/_reachsumit/status/1763061373255479503

YouTube

Show All Videos

Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation (2402.18150v2)

Summary

Related Papers

Tweets

YouTube