Learning Fast Adaptation with Meta Strategy Optimization (1909.12995v2)

Published 28 Sep 2019 in cs.RO and cs.LG

Abstract: The ability to walk in new scenarios is a key milestone on the path toward real-world applications of legged robots. In this work, we introduce Meta Strategy Optimization, a meta-learning algorithm for training policies with latent variable inputs that can quickly adapt to new scenarios with a handful of trials in the target environment. The key idea behind MSO is to expose the same adaptation process, Strategy Optimization (SO), to both the training and testing phases. This allows MSO to effectively learn locomotion skills as well as a latent space that is suitable for fast adaptation. We evaluate our method on a real quadruped robot and demonstrate successful adaptation in various scenarios, including sim-to-real transfer, walking with a weakened motor, or climbing up a slope. Furthermore, we quantitatively analyze the generalization capability of the trained policy in simulated environments. Both real and simulated experiments show that our method outperforms previous methods in adaptation to novel tasks.

Authors (5)

Wenhao Yu (139 papers)
Jie Tan (85 papers)
Yunfei Bai (21 papers)
Erwin Coumans (17 papers)
Sehoon Ha (60 papers)

Citations (87)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

YouTube

Show All Videos

Learning Fast Adaptation with Meta Strategy Optimization (1909.12995v2)

Summary

Related Papers

YouTube