DextrAH-G: Pixels-to-Action Dexterous Arm-Hand Grasping with Geometric Fabrics (2407.02274v3)
Abstract: A pivotal challenge in robotics is achieving fast, safe, and robust dexterous grasping across a diverse range of objects, an important goal within industrial applications. However, existing methods often have very limited speed, dexterity, and generality, along with limited or no hardware safety guarantees. In this work, we introduce DextrAH-G, a depth-based dexterous grasping policy trained entirely in simulation that combines reinforcement learning, geometric fabrics, and teacher-student distillation. We address key challenges in joint arm-hand policy learning, such as high-dimensional observation and action spaces, the sim2real gap, collision avoidance, and hardware constraints. DextrAH-G enables a 23 motor arm-hand robot to safely and continuously grasp and transport a large variety of objects at high speed using multi-modal inputs including depth images, allowing generalization across object geometry. Videos at https://sites.google.com/view/dextrah-g.
- M. Paré and C. Dugas. Developmental changes in prehension during childhood. Experimental brain research, 125:239–247, 1999.
- A. Miller and P. Allen. Graspit! a versatile simulator for robotic grasping. IEEE Robotics & Automation Magazine, 11(4):110–122, 2004. doi:10.1109/MRA.2004.1371616.
- Dexgraspnet: A large-scale robotic dexterous grasp dataset for general objects based on simulation. arXiv preprint arXiv:2210.02697, 2022.
- Learning diverse and physically feasible dexterous grasps with generative model and bilevel optimization. In K. Liu, D. Kulic, and J. Ichnowski, editors, Conference on Robot Learning, CoRL 2022, 14-18 December 2022, Auckland, New Zealand, volume 205 of Proceedings of Machine Learning Research, pages 1938–1948. PMLR, 2022. URL https://proceedings.mlr.press/v205/wu23b.html.
- Frogger: Fast robust grasp generation via the min-weight metric. ArXiv, abs/2302.13687, 2023. URL https://api.semanticscholar.org/CorpusID:257220021.
- C. Ferrari and J. Canny. Planning optimal grasps. In Proceedings 1992 IEEE International Conference on Robotics and Automation, pages 2290–2295 vol.3, 1992. doi:10.1109/ROBOT.1992.219918.
- Gendexgrasp: Generalizable dexterous grasping. arXiv preprint arXiv:2210.00722, 2022.
- Unidexgrasp: Universal robotic dexterous grasping via learning diverse proposal generation and goal-conditioned policy. arXiv preprint arXiv:2303.00938, 2023.
- Unidexgrasp++: Improving dexterous grasping policy learning via geometry-aware curriculum and iterative generalist-specialist learning. arXiv preprint arXiv:2304.00464, 2023.
- Dexrepnet: Learning dexterous robotic grasping network with geometric and spatial hand-object representations. 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 3153–3160, 2023. URL https://api.semanticscholar.org/CorpusID:257622709.
- Dexterous functional grasping. In 7th Annual Conference on Robot Learning, 2023. URL https://openreview.net/forum?id=93qz1k6_6h.
- Dexpoint: Generalizable point cloud reinforcement learning for sim-to-real dexterous manipulation. Conference on Robot Learning (CoRL), 2022.
- Geometric fabrics: a safe guiding medium for policy learning. arXiv preprint arXiv:2405.02250, 2024.
- Rmp2: A structured composable policy class for robot learning. arXiv preprint arXiv:2103.05922, 2021.
- Controlling contact-rich manipulation under partial observability. In Robotics: Science and Systems, 2020.
- Geometric fabrics: Generalizing classical mechanics to capture the physics of behavior. IEEE Robotics and Automation Letters, 7(2):3202–3209, 2022.
- Neural geometric fabrics: Efficiently learning high-dimensional policies from demonstration. In Conference on Robot Learning, pages 1355–1367. PMLR, 2023.
- On the utility of koopman operator theory in learning dexterous manipulation skills. In Conference on Robot Learning, pages 106–126. PMLR, 2023.
- Generalized nonlinear and finsler geometry for robotics. In 2021 IEEE International Conference on Robotics and Automation (ICRA), pages 10206–10212. IEEE, 2021.
- N. Ratliff and K. Van Wyk. Fabrics: A foundationally stable medium for encoding prior experience. arXiv preprint arXiv:2309.07368, 2023.
- M. R. Cutkosky et al. On grasp choice, grasp models, and the design of hands for manufacturing tasks. IEEE Transactions on robotics and automation, 5(3):269–279, 1989.
- Asymmetric actor critic for image-based robot learning. In H. Kress-Gazit, S. S. Srinivasa, T. Howard, and N. Atanasov, editors, Robotics: Science and Systems XIV, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA, June 26-30, 2018, 2018. doi:10.15607/RSS.2018.XIV.008. URL http://www.roboticsproceedings.org/rss14/p08.html.
- A reduction of imitation learning and structured prediction to no-regret online learning. In G. Gordon, D. Dunson, and M. Dudík, editors, Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, volume 15 of Proceedings of Machine Learning Research, pages 627–635, Fort Lauderdale, FL, USA, 11–13 Apr 2011. PMLR. URL https://proceedings.mlr.press/v15/ross11a.html.
- Benchmarking in manipulation research: The ycb object and model set and benchmarking protocols. arXiv preprint arXiv:1502.03143, 2015.
- Dexdiffuser: Generating dexterous grasps with diffusion models. arXiv preprint arXiv:2402.02989, 2024.
- Learning robust real-world dexterous grasping policies via implicit shape augmentation. arXiv preprint arXiv:2210.13638, 2022.
- M. Matak and T. Hermans. Planning visual-tactile precision grasps via complementary use of vision and touch. IEEE Robotics and Automation Letters, 8(2):768–775, 2022.
- Product design for manufacture and assembly. CRC press, 2010.
- Daydreamer: World models for physical robot learning. In K. Liu, D. Kulic, and J. Ichnowski, editors, Conference on Robot Learning, volume 205 of Proceedings of Machine Learning Research, pages 2226–2240, 14–18 Dec 2023.
- Reinforcement learning enables real-time planning and control of agile maneuvers for soft robot arms. 2023.
- Sim-to-real: Learning agile locomotion for quadruped robots. In Proceedings of Robotics: Science and Systems, 2018.
- A. Boeing and T. Bräunl. Leveraging multiple simulators for crossing the reality gap. In International Conference on Control Automation Robotics & Vision, pages 1113–1119, 2012.
- Crossing the reality gap in evolutionary robotics by promoting transferable controllers. In Conference on Genetic and Evolutionary Computation, GECCO ’10, page 119–126. Association for Computing Machinery, 2010. ISBN 9781450300728.
- Sim-to-real transfer of robotic control with dynamics randomization. In IEEE International Conference on Robotics and Automation, pages 3803–3810. IEEE, 2018.
- Learning to walk without dynamics randomization. In 2nd Workshop on Closing the Reality Gap in Sim2Real Transfer for Robotics (Robotics: Science and Systems), 2020.
- RMA: Rapid Motor Adaptation for Legged Robots. In Proceedings of Robotics: Science and Systems, 2021.
- Learning robust perceptive locomotion for quadrupedal robots in the wild. Science Robotics, 7(62):eabk2822, 2022.
- Learning to walk in minutes using massively parallel deep reinforcement learning. In Conference on Robot Learning, volume 164 of Proceedings of Machine Learning Research, pages 91–100, 2022.
- Blind bipedal stair traversal via sim-to-real reinforcement learning. In Proceedings of Robotics: Science and Systems, 2021.
- Learning dexterous in-hand manipulation. The International Journal of Robotics Research, 39(1):3–20, 2020.
- Solving Rubik’s cube with a robot hand. arXiv preprint arXiv:1910.07113, 2019.
- Dextreme: Transfer of agile in-hand manipulation from simulation to reality. arXiv preprint arXiv:2210.13702, 2022.
- Rl-cyclegan: Reinforcement learning aware simulation-to-real. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.
- Adapting Deep Visuomotor Representations with Weak Pairwise Constraints, pages 688–703. Springer International Publishing, 2020. ISBN 978-3-030-43089-4.
- Visual dexterity: In-hand reorientation of novel and complex object shapes. Science Robotics, 8(84):eadc9244, 2023. doi:10.1126/scirobotics.adc9244. URL https://www.science.org/doi/abs/10.1126/scirobotics.adc9244.
- Legged locomotion in challenging terrains using egocentric vision. In Conference on robot learning, pages 403–415. PMLR, 2023.
- Dexycb: A benchmark for capturing hand grasping of objects. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9044–9053, 2021.
- Dexpilot: Vision-based teleoperation of dexterous robotic hand-arm system. In 2020 IEEE International Conference on Robotics and Automation (ICRA), pages 9164–9170. IEEE, 2020.
- Dexpbt: Scaling up dexterous manipulation for hand-arm systems with population based training. CoRR, abs/2305.12127, 2023. URL https://doi.org/10.48550/arXiv.2305.12127.
- S. Haykin. Neural networks: a comprehensive foundation. Prentice Hall PTR, 1994.
- S. Hochreiter and J. Schmidhuber. Long short-term memory. Neural Comput., 9(8):1735–1780, nov 1997. ISSN 0899-7667. doi:10.1162/neco.1997.9.8.1735. URL https://doi.org/10.1162/neco.1997.9.8.1735.
- Dextreme: Transfer of agile in-hand manipulation from simulation to reality. In 2023 IEEE International Conference on Robotics and Automation (ICRA), pages 5977–5984. IEEE, 2023.
- trimesh. URL https://trimesh.org/.
- Isaac gym: High performance gpu-based physics simulation for robot learning. arXiv preprint arXiv:2108.10470, 2021.
- Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347, 2017.
- D. Makoviichuk and V. Makoviychuk. rl-games: A high-performance framework for reinforcement learning. https://github.com/Denys88/rl_games, 2022.
- A benchmark for rgb-d visual odometry, 3d reconstruction and slam. In 2014 IEEE international conference on Robotics and automation (ICRA), pages 1524–1531. IEEE, 2014.
- P. J. Werbos. Generalization of backpropagation with application to a recurrent gas market model. Neural Networks, 1(4):339–356, 1988. ISSN 0893-6080. doi:https://doi.org/10.1016/0893-6080(88)90007-X. URL https://www.sciencedirect.com/science/article/pii/089360808890007X.