Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 134 tok/s
Gemini 2.5 Pro 41 tok/s Pro
GPT-5 Medium 33 tok/s Pro
GPT-5 High 31 tok/s Pro
GPT-4o 108 tok/s Pro
Kimi K2 202 tok/s Pro
GPT OSS 120B 429 tok/s Pro
Claude Sonnet 4.5 37 tok/s Pro
2000 character limit reached

From Discrimination to Generation: Knowledge Graph Completion with Generative Transformer (2202.02113v7)

Published 4 Feb 2022 in cs.CL, cs.AI, cs.DB, cs.IR, and cs.LG

Abstract: Knowledge graph completion aims to address the problem of extending a KG with missing triples. In this paper, we provide an approach GenKGC, which converts knowledge graph completion to sequence-to-sequence generation task with the pre-trained LLM. We further introduce relation-guided demonstration and entity-aware hierarchical decoding for better representation learning and fast inference. Experimental results on three datasets show that our approach can obtain better or comparable performance than baselines and achieve faster inference speed compared with previous methods with pre-trained LLMs. We also release a new large-scale Chinese knowledge graph dataset AliopenKG500 for research purpose. Code and datasets are available in https://github.com/zjunlp/PromptKG/tree/main/GenKGC.

Citations (71)

Summary

  • The paper introduces GenKGC, reframing knowledge graph completion as a generative seq2seq task using BART.
  • It employs relation-guided demonstration and entity-aware hierarchical decoding to enhance few-shot performance and optimize inference.
  • Empirical results on multiple datasets show comparable performance to discriminative models with significantly improved efficiency.

Knowledge Graph Completion with GenKGC: A Generative Approach

The paper "From Discrimination to Generation: Knowledge Graph Completion with Generative Transformer" presents GenKGC, a novel methodology for knowledge graph completion (KGC) that leverages a generative approach via sequence-to-sequence (seq2seq) models. This approach aims to surpass traditional discriminative techniques which rely heavily on pre-defined scoring functions and expensive negative sampling.

Methodological Overview

The authors propose the GenKGC framework which models the KGC task as a seq2seq generation problem, utilizing a pre-trained LLM, BART. Entities and relations are represented as sequences, allowing the model to predict missing triples by generating target entities as output sequences. This generative approach contrasts with prior discriminative models such as TransE, ComplEx, and RotatE, which embed entities and relations in vector spaces and score potential triples based on geometric operations.

Innovative Framework Components

  1. Relation-Guided Demonstration: Drawing inspiration from prompt-based learning, the paper introduces relation-guided demonstrations. By incorporating triples with similar relations into the input sequence, the model can improve few-shot performance and enhance relational learning.
  2. Entity-Aware Hierarchical Decoding: To address the inefficiency in scoring all entity candidates, the authors implement a beam search with entity-aware hierarchical constraints. By using a prefix tree and type-driven constraints, the decoding process is optimized, offering significant reductions in inference time.

Experimental Results

The framework was empirically validated on multiple datasets, including WN18RR, FB15k-237, and a newly introduced large-scale dataset, OpenBG500.

  • Performance Metrics: GenKGC achieved comparable performance to existing models while demonstrating a notable reduction in inference time. Particularly, inference for OpenBG500, which has 269,658 entities, showed drastic time improvements over traditional methods such as KG-BERT.
  • Efficiency: The approach reduces memory constraints and computational demands typically associated with large-scale knowledge graphs, making it viable for real-world applications.

Implications and Future Directions

The GenKGC framework signifies a promising shift towards generative models within the KGC domain, highlighting:

  • Practical Implications: Enhanced efficiency and scalability of KGC processes, crucial for industrial applications where large datasets and rapid inference are paramount.
  • Theoretical Implications: Encouragement for further exploration of generative paradigms in knowledge representation tasks, potentially uncovering new insights and efficiencies by exploiting the seq2seq capabilities.

The paper anticipates future developments in modeling entity relationships more finely and exploring additional mechanisms for refining hierarchical decoding processes. As AI continues to evolve, the integration of generative models in knowledge-intensive domains remains an attractive frontier, poised to yield further advancements and innovations.

Dice Question Streamline Icon: https://streamlinehq.com

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Lightbulb Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets

This paper has been mentioned in 1 tweet and received 1 like.

Upgrade to Pro to view all of the tweets about this paper: