A Sensorimotor Reinforcement Learning Framework for Physical Human-Robot Interaction (1607.07939v1)

Published 27 Jul 2016 in cs.RO and cs.LG

Abstract: Modeling of physical human-robot collaborations is generally a challenging problem due to the unpredictive nature of human behavior. To address this issue, we present a data-efficient reinforcement learning framework which enables a robot to learn how to collaborate with a human partner. The robot learns the task from its own sensorimotor experiences in an unsupervised manner. The uncertainty of the human actions is modeled using Gaussian processes (GP) to implement action-value functions. Optimal action selection given the uncertain GP model is ensured by Bayesian optimization. We apply the framework to a scenario in which a human and a PR2 robot jointly control the ball position on a plank based on vision and force/torque data. Our experimental results show the suitability of the proposed method in terms of fast and data-efficient model learning, optimal action selection under uncertainties and equal role sharing between the partners.

Citations (48)

View on Semantic Scholar

Summary

The paper introduces a sensorimotor reinforcement learning framework using Gaussian processes that allows robots to learn physical collaboration strategies from raw sensorimotor data without high-level human modeling.
Experiments with a PR2 robot on a joint plank/ball task show the framework efficiently learns adaptive collaboration strategies, significantly reducing human interaction forces and improving smoothness.
This framework advances pHRI by challenging traditional leader-follower paradigms, enabling robots to learn intuitive joint tasks based purely on sensorimotor data for seamless human-robot integration.

Sensorimotor Reinforcement Learning for Human-Robot Interaction

The paper "A Sensorimotor Reinforcement Learning Framework for Physical Human-Robot Interaction" by Ghadirzadeh et al. introduces a novel approach to modeling and facilitating physical Human-Robot Interaction (pHRI). Through the deployment of a data-efficient reinforcement learning framework combined with sensorimotor learning, the authors address the inherent complexities of dynamic control tasks where humans and robots collaborate to achieve joint objectives.

Key Methodological Contributions

The paper proposes a sensorimotor approach to reinforcement learning that directly incorporates the raw sensorimotor data from the robot without the necessity of high-level symbolic task or human modeling. The robot learns from its own experiences to optimize interaction outcomes, leveraging Gaussian processes (GP) for both forward modeling and action-value function (known as Q-function) learning. This probabilistic model also enables uncertainty handling via Bayesian optimization, allowing optimal action selection despite unpredictable human behavior.

Experimental Setup and Results

The experimental section focuses on a scenario involving a PR2 robot and a human collaborator controlling the position of a ball on a jointly held plank. The robot employs measurements from vision and force/torque sensors to participate actively in ball positioning, sharing equal control with the human. Noteworthy experimental results demonstrate that the proposed framework efficiently learns collaboration strategies: it adapts quickly with a limited number of samples and significantly reduces the interaction force experienced by the human, thus facilitating smoother collaboration.

The authors report precise prediction capabilities of their forward model, especially in dimensions such as end-effector coordinates and the ball's position. The model captures sensorimotor contingencies, which suggests that the robot can implicitly model aspects of human behavior by predicting interaction forces. Additionally, the approach allows for incremental updating of the Q-function using simulated data, improving efficiency in data use.

Implications and Further Research Directions

From a theoretical perspective, this work advances the understanding of role distribution in pHRI scenarios, challenging traditional leader-follower paradigms and advocating for equal role sharing facilitated by reinforcement learning. The framework's adaptability and probabilistic components may enhance pHRI applications in areas demanding tight human-robot collaboration, such as manufacturing or remote medical operations.

Practically, this research paves the path toward more intuitive and seamless human-robot interactions. The ability to model joint tasks based solely on sensorimotor data can aid in the design of robots that naturally integrate into human workflows and adjust their actions dynamically based on human input and emergent conditions.

Looking forward to future developments, integrating richer sensory data and refining the sensorimotor mapping could further enhance robot predictive capabilities and interpersonal coordination. Exploring more complex interactions and environments could also enrich the understanding of collaboration dynamics, ensuring the robustness of this framework in varied pHRI scenarios.

In conclusion, Ghadirzadeh et al.'s paper provides significant insights into the possibilities of unsupervised sensorimotor reinforcement learning in the field of pHRI. As the research community continues to explore such intelligent systems, the implications for human-robot collaboration continue to expand, fostering environments where synergistic interactions become commonplace.

Related Papers

YouTube

Show All Videos