Graph Neural Prompting with Large Language Models (2309.15427v2)
Abstract: LLMs have shown remarkable generalization capability with exceptional performance in various language modeling tasks. However, they still exhibit inherent limitations in precisely capturing and returning grounded knowledge. While existing work has explored utilizing knowledge graphs (KGs) to enhance language modeling via joint training and customized model architectures, applying this to LLMs is problematic owing to their large number of parameters and high computational cost. Therefore, how to enhance pre-trained LLMs using grounded knowledge, e.g., retrieval-augmented generation, remains an open question. In this work, we propose Graph Neural Prompting (GNP), a novel plug-and-play method to assist pre-trained LLMs in learning beneficial knowledge from KGs. GNP encompasses various designs, including a standard graph neural network encoder, a cross-modality pooling module, a domain projector, and a self-supervised link prediction objective. Extensive experiments on multiple datasets demonstrate the superiority of GNP on both commonsense and biomedical reasoning tasks across different LLM sizes and settings. Code is available at https://github.com/meettyj/GNP.
- Palm 2 technical report. arXiv preprint arXiv:2305.10403.
- Knowledge-Augmented Language Model Prompting for Zero-Shot Knowledge Graph Question Answering. In ACL Workshop on Matching Entities.
- A multitask, multilingual, multimodal evaluation of chatgpt on reasoning, hallucination, and interactivity. arXiv preprint arXiv:2302.04023.
- Piqa: Reasoning about physical commonsense in natural language. In AAAI.
- Bodenreider, O. 2004. The unified medical language system (UMLS): integrating biomedical terminology. Nucleic acids research.
- Translating embeddings for modeling multi-relational data. In NeurIPS.
- Language models are few-shot learners. In NeurIPS.
- Sparks of artificial general intelligence: Early experiments with gpt-4. arXiv preprint arXiv:2303.12712.
- MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering. arXiv preprint arXiv:2310.05007.
- Reason out Your Layout: Evoking the Layout Master from Large Language Models for Text-to-Image Synthesis. arXiv preprint arXiv:2311.17126.
- Scaling instruction-finetuned language models. arXiv preprint arXiv:2210.11416.
- Think you have solved question answering? try arc, the ai2 reasoning challenge. arXiv preprint arXiv:1803.05457.
- Scalable multi-hop relational reasoning for knowledge-aware question answering. In EMNLP.
- DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer. arXiv preprint arXiv:2312.03724.
- Universal language model fine-tuning for text classification. In ACL.
- LoRA: Low-Rank Adaptation of Large Language Models. In ICLR.
- A survey on knowledge graphs: Representation, acquisition, and applications. IEEE transactions on neural networks and learning systems.
- Survey of hallucination in natural language generation. ACM Computing Surveys.
- Pubmedqa: A dataset for biomedical research question answering. In EMNLP.
- Jointgt: Graph-text joint representation learning for text generation from knowledge graphs. In ACL-IJCNLP.
- CrowdGraph: A Crowdsourcing Multi-Modal Knowledge Graph Approach to Explainable Fauxtography Detection. Proceedings of the ACM on Human-Computer Interaction.
- The power of scale for parameter-efficient prompt tuning. In EMNLP.
- Retrieval-augmented generation for knowledge-intensive nlp tasks. In NeurIPS.
- Prefix-tuning: Optimizing continuous prompts for generation. In ACL-IJCNLP.
- Kagnet: Knowledge-aware graph networks for commonsense reasoning. In EMNLP.
- RiddleSense: Reasoning about Riddle Questions Featuring Linguistic Creativity and Commonsense Knowledge. In ACL-IJCNLP.
- Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing. ACM Computing Surveys.
- Learn to explain: Multimodal reasoning via thought chains for science question answering. In NeurIPS.
- Graph-based reasoning over heterogeneous external knowledge for commonsense question answering. In AAAI.
- Can a suit of armor conduct electricity? a new dataset for open book question answering. In EMNLP.
- Knowledgeable reader: Enhancing cloze-style reading comprehension with external commonsense knowledge. In ACL.
- OpenAI. 2023. GPT-4 Technical Report. arXiv preprint arXiv:2303.08774.
- Unifying Large Language Models and Knowledge Graphs: A Roadmap. arXiv preprint arXiv:2306.08302.
- Lego: Latent execution-guided reasoning for multi-hop question answering on knowledge graphs. In ICML.
- Leveraging large language models for multiple choice question answering. In ICLR.
- Bloom: A 176b-parameter open-access multilingual language model. arXiv preprint arXiv:2211.05100.
- Mededit: Model editing for medical question answering with external knowledge bases. arXiv preprint arXiv:2309.16035.
- Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model. arXiv preprint arXiv:2201.11990.
- Conceptnet 5.5: An open multilingual graph of general knowledge. In AAAI.
- Ernie 3.0: Large-scale knowledge enhanced pre-training for language understanding and generation. arXiv preprint arXiv:2107.02137.
- Positive-unlabeled learning with adversarial data augmentation for knowledge graph completion. In IJCAI.
- Heterogeneous Graph Masked Autoencoders. In AAAI.
- Learning MLPs on Graphs: A Unified View of Effectiveness, Robustness, and Efficiency. In ICLR.
- Llama: Open and efficient foundation language models. arXiv preprint arXiv:2302.13971.
- An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition. BMC bioinformatics.
- Graph attention networks. In ICLR.
- Improving natural language inference using external knowledge in the science questions domain. In AAAI.
- Graph neural networks: Self-supervised learning. Graph Neural Networks: Foundations, Frontiers, and Applications.
- Knowledge Graph Prompting for Multi-Document Question Answering. arXiv preprint arXiv:2308.11730.
- Finetuned language models are zero-shot learners. In ICLR.
- Emergent abilities of large language models. Transactions on Machine Learning Research.
- LLMRec: Large Language Models with Graph Augmentation for Recommendation. In WSDM.
- A learning algorithm for continually running fully recurrent neural networks. Neural computation.
- A Reusable Model-agnostic Framework for Faithfully Explainable Recommendation and System Scrutability. ACM Transactions on Information Systems.
- Embedding entities and relations for learning and inference in knowledge bases. In ICLR.
- Deep bidirectional language-knowledge graph pretraining. In NeurIPS.
- Linkbert: Pretraining language models with document links. In ACL.
- QA-GNN: Reasoning with language models and knowledge graphs for question answering. In NAACL.
- Jaket: Joint pre-training of knowledge graph and language understanding. In AAAI.
- Benchmarking large language models for news summarization. arXiv preprint arXiv:2301.13848.
- GreaseLM: Graph REASoning Enhanced Language Models for Question Answering. In ICLR.
- A survey of large language models. arXiv preprint arXiv:2303.18223.
- Retrieving and reading: A comprehensive survey on open-domain question answering. arXiv preprint arXiv:2101.00774.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.