Papers

Topics

Authors

Recent

View all

Assistant

AI Research Assistant

Well-researched responses based on relevant abstracts and paper content.

Custom Instructions Pro

Preferences or requirements that you'd like Emergent Mind to consider when generating responses.

Gemini 2.5 Flash

Gemini 2.5 Flash 155 tok/s

Gemini 2.5 Pro 43 tok/s Pro

GPT-5 Medium 20 tok/s Pro

GPT-5 High 20 tok/s Pro

GPT-4o 86 tok/s Pro

Kimi K2 184 tok/s Pro

GPT OSS 120B 446 tok/s Pro

Claude Sonnet 4.5 31 tok/s Pro

2000 character limit reached

Expressive Whole-Body Control for Humanoid Robots (2402.16796v2)

Published 26 Feb 2024 in cs.RO and cs.LG

Abstract: Can we enable humanoid robots to generate rich, diverse, and expressive motions in the real world? We propose to learn a whole-body control policy on a human-sized robot to mimic human motions as realistic as possible. To train such a policy, we leverage the large-scale human motion capture data from the graphics community in a Reinforcement Learning framework. However, directly performing imitation learning with the motion capture dataset would not work on the real humanoid robot, given the large gap in degrees of freedom and physical capabilities. Our method Expressive Whole-Body Control (Exbody) tackles this problem by encouraging the upper humanoid body to imitate a reference motion, while relaxing the imitation constraint on its two legs and only requiring them to follow a given velocity robustly. With training in simulation and Sim2Real transfer, our policy can control a humanoid robot to walk in different styles, shake hands with humans, and even dance with a human in the real world. We conduct extensive studies and comparisons on diverse motions in both simulation and the real world to show the effectiveness of our approach.

References (60)

Citations (40)

View on Semantic Scholar

Summary

The paper presents ExBody, a novel RL-based control policy trained on extensive human motion capture data to enable expressive robot movements.
It employs motion retargeting to adapt human motion data to robotic hardware, focusing on human-like upper body gestures.
Results demonstrate effective sim-to-real transfer, allowing humanoid robots to perform diverse tasks such as dancing and interactive movements.

Expressive Whole-Body Control for Humanoid Robots

Introduction

Recent advances in humanoid robotics have sought to bridge the gap between the mechanical movements traditionally associated with robots and the expressive, nuanced motions observed in living beings. In the presented work, a notable contribution towards this aim has been made through the development of an expressive whole-body control policy, referred to as ExBody, for humanoid robots. This initiative not only aims to enhance the robots' ability to perform a wider range of movements but is also designed to imbue them with a level of expressiveness that is closely akin to human motion.

Methodology

Learning from Human Motion Capture Data

The foundation of ExBody's approach lies in utilizing large-scale human motion capture data from the graphics community, integrating this with deep Reinforcement Learning (RL) in a simulated environment. This methodology enables the derivation of a control policy that can subsequently be transferred to a real humanoid robot, facilitating movements that are rich in expressivity.

Motion Retargeting

A key challenge in this process is the inherent difference in degrees of freedom (DoF) and physical capacity between the human models used in motion capture data and the actual robotic hardware. ExBody addresses this by employing motion retargeting techniques, adapting human motion data to fit the robotic model utilized in this paper, the Unitree H1 robot.

Reinforcement Learning Framework

The RL framework constitutes the core of the ExBody approach, where the humanoid robot is trained to follow command-conditioned locomotion alongside achieving specified expressive motion goals. These goals are derived from the human motion capture data, focusing on the upper body to ensure a practical level of expressivity without overly constraining the robot's movement capabilities. The policy is trained in highly randomized environments to ensure robustness and feasibility of sim-to-real transfer.

Results and Discussion

Policy Effectiveness

The paper demonstrates the effectiveness of ExBody in several key areas. The robot is capable of executing diverse movements with significant expressiveness, such as dancing and walking with varied gestures. The behavior of the robot in these instances showcases an unprecedented level of nuance in robotic movement that is both diverse and human-like.

Sim-to-Real Transfer

Crucial to the success of ExBody is its robust sim-to-real transfer capability. The control policy, once trained in simulation, exhibits high compliance and effectiveness when deployed in the real world, enabling the humanoid robot to interact dynamically with human partners and navigate various terrains with expressive motions.

Potential Implications and Future Directions

The implications of this research are significant for the future development of humanoid robots. By enabling robots to perform more expressive and varied movements, ExBody opens up new possibilities for human-robot interaction, entertainment, and assistive technologies. The approach also presents a scalable framework that could potentially be adapted for other robotic form factors.

However, the paper also recognizes the limitations inherent in the current implementation, such as the challenges related to motion retargeting and the risks associated with robot falls. Future work will need to address these challenges, further refining the control policy and exploring protective measures for the robots.

Conclusion

The presented work on ExBody represents a significant step forward in the quest for more expressive and human-like movement in humanoid robots. By leveraging extensive human motion capture data and employing a refined RL framework, the paper not only enhances the robots' movement capabilities but also enriches their ability to interact with the surrounding environment in a way that is both meaningful and engaging.