Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 134 tok/s
Gemini 2.5 Pro 41 tok/s Pro
GPT-5 Medium 29 tok/s Pro
GPT-5 High 31 tok/s Pro
GPT-4o 124 tok/s Pro
Kimi K2 204 tok/s Pro
GPT OSS 120B 432 tok/s Pro
Claude Sonnet 4.5 37 tok/s Pro
2000 character limit reached

Opening Articulated Structures in the Real World (2402.17767v3)

Published 27 Feb 2024 in cs.RO, cs.AI, cs.CV, and cs.LG

Abstract: What does it take to build mobile manipulation systems that can competently operate on previously unseen objects in previously unseen environments? This work answers this question using opening of articulated structures as a mobile manipulation testbed. Specifically, our focus is on the end-to-end performance on this task without any privileged information, i.e. the robot starts at a location with the novel target articulated object in view, and has to approach the object and successfully open it. We first develop a system for this task, and then conduct 100+ end-to-end system tests across 13 real world test sites. Our large-scale study reveals a number of surprising findings: a) modular systems outperform end-to-end learned systems for this task, even when the end-to-end learned systems are trained on 1000+ demonstrations, b) perception, and not precise end-effector control, is the primary bottleneck to task success, and c) state-of-the-art articulation parameter estimation models developed in isolation struggle when faced with robot-centric viewpoints. Overall, our findings highlight the limitations of developing components of the pipeline in isolation and underscore the need for system-level research, providing a pragmatic roadmap for building generalizable mobile manipulation systems. Videos, code, and models are available on the project website: https://arjung128.github.io/opening-articulated-structures/

Definition Search Book Streamline Icon: https://streamlinehq.com
References (41)
  1. Do as i can and not as i say: Grounding language in robotic affordances. In arXiv preprint arXiv:2204.01691, 2022.
  2. Demonstrating mobile manipulation in the wild: A metrics-driven approach. In Robotics: Science and Systems XIX, RSS2023. Robotics: Science and Systems Foundation, July 2023. doi: 10.15607/rss.2023.xix.055. URL http://dx.doi.org/10.15607/RSS.2023.XIX.055.
  3. Dmitry Berenson. Obeying Constraints During Motion Planning, pages 1–32. Springer Netherlands, 2018.
  4. Task space regions: A framework for pose-constrained manipulation planning. IJRR, 30(12):1435–1460, 2011.
  5. Whole-body motion planning for manipulation of articulated objects. In ICRA, pages 1656–1662, 2013. ISBN 9781467356411. doi: 10.1109/ICRA.2013.6630792.
  6. Planning for autonomous door opening with a mobile manipulator. In 2010 IEEE International Conference on Robotics and Automation, pages 1799–1806. IEEE, 2010.
  7. Manipulathor: A framework for visual object manipulation. In CVPR, pages 4497–4506, 2021.
  8. Real-time motion planning of legged robots: A model predictive control approach. In ICHR, pages 577–584, 2017.
  9. Deep whole-body control: Learning a unified policy for manipulation and locomotion. In Conference on Robot Learning (CoRL), 2022.
  10. Mobile aloha: Learning bimanual mobile manipulation with low-cost whole-body teleoperation. arXiv preprint arXiv:2401.02117, 2024.
  11. Threedworld: A platform for interactive multi-modal physical simulation. In Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track, 2021.
  12. Predicting motion plans for articulating everyday objects. In International Conference on Robotics and Automation (ICRA). IEEE, 2023.
  13. Mask r-cnn. In ICCV, pages 2961–2969, 2017.
  14. Pulling open doors and drawers: Coordinating an omni-directional base and a compliant arm with equilibrium point control. In 2010 IEEE International Conference on Robotics and Automation, pages 1807–1814. IEEE, 2010.
  15. Opd: Single-view 3d openable part detection. In Shai Avidan, Gabriel Brostow, Moustapha Cissé, Giovanni Maria Farinella, and Tal Hassner, editors, Computer Vision – ECCV 2022, pages 410–426, Cham, 2022. Springer Nature Switzerland. ISBN 978-3-031-19842-7.
  16. An adaptive control approach for opening doors and drawers under uncertainties. IEEE Transactions on Robotics, 32(1):161–175, 2016.
  17. Probabilistic roadmaps for path planning in high-dimensional configuration spaces. IEEE transactions on Robotics and Automation, 12(4):566–580, 1996.
  18. Sampling-based methods for motion planning with constraints. Annual review of control, robotics, and autonomous systems, 1:159–185, 2018.
  19. AI2-THOR: An Interactive 3D Environment for Visual AI. arXiv, 2017.
  20. RRT-connect: An efficient approach to single-query path planning. In ICRA, 2000.
  21. Paris: Part-level reconstruction and motion analysis for articulated objects. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 352–363, October 2023.
  22. Autonomous door opening and plugging in with a personal robot. In 2010 IEEE International Conference on Robotics and Automation, pages 729–736. IEEE, 2010.
  23. Articulated object interaction in unknown scenes with whole-body mobile manipulation. In IROS, 2022.
  24. Where2act: From pixels to actions for articulated 3d objects. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 6813–6823, October 2021.
  25. Where2explore: Few-shot affordance learning for unseen novel categories of articulated objects. In Advances in Neural Information Processing Systems, 2023.
  26. Perceptive model predictive control for continuous mobile manipulation. IEEE RA-L, pages 6177–6184, 2020.
  27. High-level control of a mobile manipulator for door opening. In Proceedings. 2000 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2000)(Cat. No. 00CH37113), volume 3, pages 2333–2338. IEEE, 2000.
  28. Understanding 3d object interaction from a single image. arXiv preprint arXiv:2305.09664, 2023.
  29. Habitat-matterport 3d dataset (HM3D): 1000 large-scale 3d environments for embodied AI. In Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 2), 2021. URL https://openreview.net/forum?id=-v4OuqNs5P.
  30. A generalized framework for opening doors and drawers in kitchen environments. In ICRA, pages 3852–3858, 2012. doi: 10.1109/ICRA.2012.6224929.
  31. Motion planning with sequential convex optimization and convex collision checking. The International Journal of Robotics Research, 33(9):1251–1270, 2014.
  32. On bringing robots home, 2023.
  33. A unified mpc framework for whole-body dynamic locomotion and manipulation. IEEE RA-L, pages 4688–4695, 2021.
  34. Versatile multicontact planning and control for legged loco-manipulation. Science Robotics, 8(81), August 2023. ISSN 2470-9476. doi: 10.1126/scirobotics.adg5014. URL http://dx.doi.org/10.1126/scirobotics.adg5014.
  35. Opdmulti: Openable part detection for multiple objects, 2023.
  36. Robot placement based on reachability inversion. In ICRA, pages 1970–1975, 2013. doi: 10.1109/ICRA.2013.6630839.
  37. VAT-mart: Learning visual action trajectory proposals for manipulating 3d ARTiculated objects. In International Conference on Learning Representations, 2022. URL https://openreview.net/forum?id=iEx3PiooLy.
  38. Sapien: A simulated part-based interactive environment. In CVPR, pages 11097–11107, 2020.
  39. Harmonic mobile manipulation. arXiv preprint arXiv:2312.06639, 2023.
  40. Homerobot: Open-vocabulary mobile manipulation. arXiv preprint arXiv:2306.11565, 2023.
  41. Chomp: Covariant hamiltonian optimization for motion planning. The International Journal of Robotics Research, 32(9-10):1164–1193, 2013.
Citations (2)

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Lightbulb Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets

This paper has been mentioned in 1 tweet and received 8 likes.

Upgrade to Pro to view all of the tweets about this paper:

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube