Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
144 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Recover: A Neuro-Symbolic Framework for Failure Detection and Recovery (2404.00756v1)

Published 31 Mar 2024 in cs.AI, cs.LG, cs.LO, and cs.RO

Abstract: Recognizing failures during task execution and implementing recovery procedures is challenging in robotics. Traditional approaches rely on the availability of extensive data or a tight set of constraints, while more recent approaches leverage LLMs to verify task steps and replan accordingly. However, these methods often operate offline, necessitating scene resets and incurring in high costs. This paper introduces Recover, a neuro-symbolic framework for online failure identification and recovery. By integrating ontologies, logical rules, and LLM-based planners, Recover exploits symbolic information to enhance the ability of LLMs to generate recovery plans and also to decrease the associated costs. In order to demonstrate the capabilities of our method in a simulated kitchen environment, we introduce OntoThor, an ontology describing the AI2Thor simulator setting. Empirical evaluation shows that OntoThor's logical rules accurately detect all failures in the analyzed tasks, and that Recover considerably outperforms, for both failure detection and recovery, a baseline method reliant solely on LLMs.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (29)
  1. SWI-prolog. https://www.swi-prolog.org. Version: 8.3.3.
  2. Do as i can, not as i say: Grounding language in robotic affordances. arXiv preprint arXiv:2204.01691, 2022.
  3. A survey on policy search algorithms for learning robot controllers in a handful of trials. IEEE Transactions on Robotics, 36(2):328–347, 2019.
  4. Semantic-based explainable ai: Leveraging semantic scene graphs and pairwise ranking to explain robot failures. In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 3034–3041. IEEE, 2021.
  5. An ontology for failure interpretation in automated planning and execution. In Iberian Robotics conference, pages 381–390. Springer, 2019.
  6. Roscoe: A suite of metrics for scoring step-by-step reasoning. In The Eleventh International Conference on Learning Representations, 2022.
  7. Audioclip: Extending clip to image, text and audio. In ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 976–980. IEEE, 2022.
  8. Language models as zero-shot planners: Extracting actionable knowledge for embodied agents. In International Conference on Machine Learning, pages 9118–9147. PMLR, 2022.
  9. Robotgpt: Robot manipulation learning from chatgpt. IEEE Robotics and Automation Letters, 2024.
  10. Graph density-aware losses for novel compositions in scene graph generation. In BMVC, 2020.
  11. AI2-THOR: An Interactive 3D Environment for Visual AI. arXiv, 2017.
  12. Jean-Baptiste Lamy. Owlready: Ontology-oriented programming in python with automatic classification and high level constructs for biomedical ontologies. Artificial intelligence in medicine, 80:11–28, 2017.
  13. Semantically aligned task decomposition in multi-agent reinforcement learning. arXiv preprint arXiv:2305.10865, 2023.
  14. REFLECT: Summarizing robot experiences for failure explanation and correction. In 7th Annual Conference on Robot Learning, 2023.
  15. Rdfox: A highly-scalable rdf store. In The Semantic Web-ISWC 2015: 14th International Semantic Web Conference, Bethlehem, PA, USA, October 11-15, 2015, Proceedings, Part II 14, pages 3–20. Springer, 2015.
  16. Unifying large language models and knowledge graphs: A roadmap. IEEE Transactions on Knowledge & Data Engineering, (01):1–20, 2024.
  17. Robots that ask for help: Uncertainty alignment for large language model planners. In 7th Annual Conference on Robot Learning, 2023.
  18. Toolformer: Language models can teach themselves to use tools. In Thirty-seventh Conference on Neural Information Processing Systems, 2023.
  19. Reflexion: Language agents with verbal reinforcement learning. In arXiv/2303.11366, 2023.
  20. Chatgpt for robotics: Design principles and model abilities. arXiv preprint arXiv:2306.17582, 2023.
  21. Learning from mistakes via cooperative study assistant for large language models. In The 2023 Conference on Empirical Methods in Natural Language Processing, 2023.
  22. Self-consistency improves chain of thought reasoning in language models. In The Eleventh International Conference on Learning Representations, 2023.
  23. Emergent abilities of large language models. Transactions on Machine Learning Research, 2022.
  24. Chain of thought prompting elicits reasoning in large language models. CoRR, abs/2201.11903, 2022.
  25. Tidybot: Personalized robot assistance with large language models. Autonomous Robots, 2023.
  26. Keep calm and explore: Language models for action generation in text-based games. In Empirical Methods in Natural Language Processing (EMNLP), 2020.
  27. React: Synergizing reasoning and acting in language models. In The Eleventh International Conference on Learning Representations, 2023.
  28. Retroformer: Retrospective large language agents with policy gradient optimization. In The Twelfth International Conference on Learning Representations, 2024.
  29. Expel: Llm agents are experiential learners. In arXiv/2308.10144, 2023.
Citations (3)

Summary

We haven't generated a summary for this paper yet.