Large Language Models Are Neurosymbolic Reasoners (2401.09334v1)
Abstract: A wide range of real-world applications is characterized by their symbolic nature, necessitating a strong capability for symbolic reasoning. This paper investigates the potential application of LLMs as symbolic reasoners. We focus on text-based games, significant benchmarks for agents with natural language capabilities, particularly in symbolic tasks like math, map reading, sorting, and applying common sense in text-based worlds. To facilitate these agents, we propose an LLM agent designed to tackle symbolic challenges and achieve in-game objectives. We begin by initializing the LLM agent and informing it of its role. The agent then receives observations and a set of valid actions from the text-based games, along with a specific symbolic module. With these inputs, the LLM agent chooses an action and interacts with the game environments. Our experimental results demonstrate that our method significantly enhances the capability of LLMs as automated agents for symbolic reasoning, and our LLM agent is effective in text-based games involving symbolic tasks, achieving an average performance of 88% across all tasks.
- Learning dynamic belief graphs to generalize on text-based games. In Advances in Neural Information Processing Systems.
- Graph constrained reinforcement learning for natural language action spaces. In International Conference on Learning Representations.
- How to Motivate Your Dragon: Teaching Goal-Driven Agents to Speak and Act in Fantasy Worlds. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 807–833.
- A hybrid neuro-symbolic approach for text-based games using inductive logic programming. In Combining Learning and Reasoning: Programming Languages, Formalisms, and Representations.
- In-context policy iteration. arXiv preprint arXiv:2210.03821.
- Language models are few-shot learners. In Advances in Neural Information Processing Systems.
- SenticNet 7: A commonsense-based neurosymbolic AI framework for explainable sentiment analysis. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, 3829–3839.
- Decision transformer: Reinforcement learning via sequence modeling. In Advances in Neural Information Processing Systems.
- Textworld: A learning environment for text-based games. In Computer Games: 7th Workshop, CGW 2018, Held in Conjunction with the 27th International Conference on Artificial Intelligence.
- Deep Reinforcement Learning with a Combinatorial Action Space for Predicting Popular Reddit Threads. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 1838–1848.
- What would jiminy cricket do? Towards agents that behave morally. In Advances in Neural Information Processing Systems.
- Language models as zero-shot planners: Extracting actionable knowledge for embodied agents. In International Conference on Machine Learning.
- Inner monologue: Embodied reasoning through planning with language models. arXiv preprint arXiv:2207.05608.
- TextWorldExpress: Simulating Text Games at One Million Steps Per Second. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 169–177.
- LOA: Logical Optimal Actions for Text-based Interaction Games. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: System Demonstrations, 227–231.
- Neuro-Symbolic Reinforcement Learning with First-Order Logic. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 3505–3511.
- Reward design with language models. In International Conference on Learning Representations.
- Deep Learning For Symbolic Mathematics. In International Conference on Learning Representations.
- Code as policies: Language model programs for embodied control. In 2023 IEEE International Conference on Robotics and Automation, 9493–9500.
- Learning object-oriented dynamics for planning from text. In International Conference on Learning Representations.
- Self-refine: Iterative refinement with self-feedback. arXiv preprint arXiv:2303.17651.
- MetaICL: Learning to Learn In Context. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2791–2809.
- Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602.
- Text-based rl agents with commonsense knowledge: New challenges, environments and baselines. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 35, 9018–9027.
- Language Understanding for Text-based Games using Deep Reinforcement Learning. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 1–11.
- OpenAI. 2023. GPT-4 Technical Report. ArXiv, abs/2303.08774.
- A survey of text games for reinforcement learning informed by natural language. Transactions of the Association for Computational Linguistics, 873–887.
- Contrastive reinforcement learning of symbolic reasoning domains. In Advances in Neural Information Processing Systems.
- Limitations of Language Models in Arithmetic and Symbolic Induction. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 9285–9298.
- Exploring the limits of transfer learning with a unified text-to-text transformer. The Journal of machine learning Research, 21(1): 5485–5551.
- Fire Burns, Sword Cuts: Commonsense Inductive Bias for Exploration in Text-based Games. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 515–522.
- Stay moral and explore: Learn to behave morally in text-based games. In International Conference on Learning Representations.
- Reflexion: an autonomous agent with dynamic memory and self-reflection. arXiv preprint arXiv:2303.11366.
- Progprompt: Generating situated robot task plans using large language models. In 2023 IEEE International Conference on Robotics and Automation, 11523–11530.
- Reinforcement learning: An introduction. MIT press.
- ChatGPT for Robotics: Design Principles and Model Abilities. Technical Report MSR-TR-2023-8, Microsoft.
- ScienceWorld: Is your Agent Smarter than a 5th Grader? In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 11279–11298.
- Behavior Cloned Transformers are Neurosymbolic Reasoners. In Conference of the European Chapter of the Association for Computational Linguistics, 2777–2788.
- NLProlog: Reasoning with Weak Unification for Question Answering in Natural Language. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 6151–6161.
- Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. In Advances in Neural Information Processing Systems.
- Generalization in Text-based Games via Hierarchical Reinforcement Learning. In Findings of the Association for Computational Linguistics: EMNLP 2021, 1343–1353.
- Perceiving the World: Question-guided Reinforcement Learning for Text-based Games. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 538–560.
- Deep reinforcement learning with stacked hierarchical attention for text-based games. In Advances in Neural Information Processing Systems.
- Foundation models for decision making: Problems, methods, and opportunities. arXiv preprint arXiv:2303.04129.
- Keep CALM and Explore: Language Models for Action Generation in Text-based Games. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 8736–8754.
- React: Synergizing reasoning and acting in language models. arXiv preprint arXiv:2210.03629.
- Comprehensible context-driven text game playing. In IEEE Conference on Games, 1–8.
- Counting to explore and generalize in text-based games. arXiv preprint arXiv:1806.11525.
- PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), 2040–2050.