Language Models are Open Knowledge Graphs (2010.11967v1)

Published 22 Oct 2020 in cs.CL, cs.AI, and cs.LG

Abstract: This paper shows how to construct knowledge graphs (KGs) from pre-trained LLMs (e.g., BERT, GPT-2/3), without human supervision. Popular KGs (e.g, Wikidata, NELL) are built in either a supervised or semi-supervised manner, requiring humans to create knowledge. Recent deep LLMs automatically acquire knowledge from large-scale corpora via pre-training. The stored knowledge has enabled the LLMs to improve downstream NLP tasks, e.g., answering questions, and writing code and articles. In this paper, we propose an unsupervised method to cast the knowledge contained within LLMs into KGs. We show that KGs are constructed with a single forward pass of the pre-trained LLMs (without fine-tuning) over the corpora. We demonstrate the quality of the constructed KGs by comparing to two KGs (Wikidata, TAC KBP) created by humans. Our KGs also provide open factual knowledge that is new in the existing KGs. Our code and KGs will be made publicly available.

Citations (127)

View on Semantic Scholar

Summary

The paper introduces the unsupervised 'Match and Map' technique that extracts triplet facts from language models using attention mechanisms.
It demonstrates that the approach achieves over 60% precision in aligning candidate facts with existing knowledge graphs.
The study highlights potential improvements in recall and scalability by leveraging larger language models for more comprehensive knowledge extraction.

Insightful Overview of "LLMs are \Open Knowledge Graphs"

The paper "LLMs are \Open Knowledge Graphs" articulates an innovative approach to constructing knowledge graphs (KGs) from pre-trained LLMs (LMs) such as BERT and GPT-2/3. The fundamental proposition of this paper is to extract knowledge from these LMs without relying on human supervision, which marks a significant shift from traditional supervised KG construction methods.

Methodology and Contributions

The methodology centers around the introduction of an unsupervised technique termed "Match and Map" (MaMa). This strategy involves two fundamental stages:

Match Stage: Candidate facts are generated by matching text from a corpus with pre-trained LM knowledge. The technique leverages the attention mechanisms inherent in LMs to identify relationships between entities expressed as triplets (head, relation, tail). Beam search is employed to capture the most probable relationships without any fine-tuning.
Map Stage: These candidate facts are mapped onto structured knowledge graphs. If candidate facts align with existing KG schemas (e.g., Wikidata), they are incorporated directly. Otherwise, they are added to an open schema allowing for knowledge expansion beyond current schemas.

A distinguishing feature of the proposed system is its ability to generate "open KGs." These incorporate both facts within existing KG structures and entirely new facts in open schemas, which are not covered by the existing knowledge bases.

Results and Evaluation

The paper highlights the potential of MaMa through rigorous evaluations against existing knowledge graphs, such as TAC KBP and Wikidata. Remarkable precision was evident in their output, with precision exceeding 60% in several cases, effectively outperforming existing systems like Stanford OpenIE and OpenIE 5.1. Recall capabilities, while lower, provide an avenue for future optimization, especially with larger and more diverse linguistic models.

Moreover, deeper and larger LMs like GPT-2 XL exhibit enhanced performance, suggesting that larger models embed richer and more complex knowledge structures. It’s crucial to note that BERT-based models demonstrated higher recall than their similarly-sized GPT-2 counterparts, indicating the efficacy of BERT’s masked LLMing.

Implications and Future Directions

The implications of extracting open KGs from LMs extend into various domains, primarily for knowledge expansion in AI applications, KG construction, and enhancing deep neural network interpretability. As demonstrated, MaMa can uncover "new-in-the-existing-KG" factual knowledge, broadening the useful scope of KGs.

Looking ahead, advancing this work involves improving recall and exploring the potential of even larger LMs, such as GPT-3 or Megatron-LM. There is also room for refining the alignment algorithms for better entity detection and relation mapping. Furthermore, integrating crowdsourcing evaluations or more sophisticated methods, such as graph neural networks, could enhance both the extraction precision and the understanding of the nuanced knowledge embedded in LMs.

In conclusion, this paper offers a compelling perspective on leveraging the stored knowledge in LLMs to construct and expand knowledge graphs, presenting annotated insights into the knowledge acquisition capacities of unsupervised LMs. This innovation could prove essential in bridging the gap between deep learning models and structured knowledge representation systems.

PDF Markdown

Related Papers

YouTube

Show All Videos