Awakening Augmented Generation: Learning to Awaken Internal Knowledge of Large Language Models for Question Answering (2403.15268v5)

Published 22 Mar 2024 in cs.CL

Abstract: Retrieval-Augmented-Generation and Generation-Augmented-Generation have been proposed to enhance the knowledge required for question answering with LLMs by leveraging richer context. However, the former relies on external resources, and both require incorporating explicit documents into the context, which increases execution costs and susceptibility to noise data during inference. Recent works indicate that LLMs model rich knowledge, but it is often not effectively activated and awakened. Inspired by this, we propose a novel knowledge-augmented framework, $\textbf{Awakening-Augmented-Generation}$ (AAG), which mimics the human ability to answer questions using only thinking and recalling to compensate for knowledge gaps, thereby awaking relevant knowledge in LLMs without relying on external resources. AAG consists of two key components for awakening richer context. Explicit awakening fine-tunes a context generator to create a synthetic, compressed document that functions as symbolic context. Implicit awakening utilizes a hypernetwork to generate adapters based on the question and synthetic document, which are inserted into LLMs to serve as parameter context. Experimental results on three datasets demonstrate that AAG exhibits significant advantages in both open-domain and closed-book settings, as well as in out-of-distribution generalization. Our code will be available at \url{https://github.com/Xnhyacinth/IAG}.

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/arjunaaqa/status/1772155057615589666

YouTube

Show All Videos

Awakening Augmented Generation: Learning to Awaken Internal Knowledge of Large Language Models for Question Answering (2403.15268v5)

Summary

Related Papers

Tweets

YouTube