Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous Sources (2305.13269v4)

Published 22 May 2023 in cs.CL

Abstract: We present chain-of-knowledge (CoK), a novel framework that augments LLMs by dynamically incorporating grounding information from heterogeneous sources. It results in more factual rationales and reduced hallucination in generation. Specifically, CoK consists of three stages: reasoning preparation, dynamic knowledge adapting, and answer consolidation. Given a knowledge-intensive question, CoK first prepares several preliminary rationales and answers while identifying the relevant knowledge domains. If there is no majority consensus among the answers from samples, CoK corrects the rationales step by step by adapting knowledge from the identified domains. These corrected rationales can plausibly serve as a better foundation for the final answer consolidation. Unlike prior studies that primarily use unstructured data, CoK also leverages structured knowledge sources such as Wikidata and tables that provide more reliable factual information. To access both unstructured and structured knowledge sources in the dynamic knowledge adapting stage, we propose an adaptive query generator that allows the generation of queries for various types of query languages, including SPARQL, SQL, and natural sentences. Moreover, to minimize error propagation between rationales, CoK corrects the rationales progressively using preceding corrected rationales to generate and correct subsequent rationales. Extensive experiments show that CoK consistently improves the performance of LLMs on knowledge-intensive tasks across different domains.

References (39)

Citations (69)

View on Semantic Scholar

Summary

The paper introduces the CoK framework that dynamically adapts queries to integrate diverse knowledge sources, boosting LLM performance by an average of 4.3% over CoT baselines.
It employs an adaptive query generator that produces SPARQL and SQL queries to retrieve and verify structured data from sources like Wikidata.
The three-stage process—reasoning preparation, dynamic knowledge adapting, and answer consolidation—significantly mitigates hallucinations and improves factual accuracy across multiple domains.

Chain-of-Knowledge: Grounding LLMs via Dynamic Knowledge Adapting over Heterogeneous Sources

This paper introduces a framework named chain-of-knowledge (CoK) that enhances LLMs by dynamically integrating grounding information from diverse sources to improve factuality and reduce hallucinations. Unlike existing approaches that primarily rely on unstructured data, CoK leverages both structured knowledge sources, such as Wikidata and tabular data, and unstructured data, utilizing an adaptive query generator capable of generating and executing various queries, including SPARQL and SQL.

The CoK framework consists of three stages: reasoning preparation, dynamic knowledge adapting, and answer consolidation. The initial stage involves preparation where several preliminary rationales and potential answers are generated, and relevant knowledge domains are identified. If there is no consensus among the answers, the dynamic knowledge adapting stage progressively corrects rationales using information from identified knowledge domains. Finally, the corrected rationales form a solid foundation for the answer consolidation stage.

A key innovation in CoK is the adaptive query generator, a flexible component that can be either a fine-tuned model (e.g., LLaMA-2 with LoRA) or an off-the-shelf LLM (e.g., ChatGPT). This generator adapts to various knowledge sources by producing queries suited to their formats, facilitating the retrieval of more reliable and domain-specific information.

Empirical results demonstrate that CoK consistently improves LLM performance across tasks requiring intensive factual knowledge, with an average performance enhancement of 4.3% over chain-of-thought (CoT) baselines. Notably, the use of CoK on knowledge-intensive tasks in factual, medical, physics, and biology domains showcases the efficacy of integrating diverse data sources to augment LLM capabilities.

CoK effectively addresses the inherent challenges of hallucination and factual inaccuracies in LLMs. By leveraging structured knowledge sources and dynamically adapting retrieval strategies, CoK facilitates the generation of more accurate rationales and predictions. Its modular design allows it to be adapted for use with various LLMs and knowledge sources, offering potential for significant advancements in AI applications requiring robust information verification and factual precision.

The implications of this research are profound for the future of AI. As LLM capabilities continue to advance, frameworks like CoK may become essential in domains requiring high factual reliability, such as legal, scientific, and educational applications, where accuracy and source validation are paramount. The approach taken by this paper sets a precedent for integrating diverse knowledge formats and sources into model training and evaluation, emphasizing a path forward towards reducing AI-generated misinformation. Additionally, the advancements in adaptive query generation highlight directions for further research in natural language processing tasks, particularly in enhancing the interaction between LLMs and structured knowledge bases.

PDF Markdown

Related Papers

Tweets

https://twitter.com/XingxuanLi/status/1747903633918927202

https://twitter.com/TheTuringPost/status/1833203820383637926

https://twitter.com/XingxuanLi/status/1747903651233059032

YouTube

Show All Videos