Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Leveraging Reward Gradients For Reinforcement Learning in Differentiable Physics Simulations (2203.02857v1)

Published 6 Mar 2022 in cs.LG, cs.RO, cs.SY, and eess.SY

Abstract: In recent years, fully differentiable rigid body physics simulators have been developed, which can be used to simulate a wide range of robotic systems. In the context of reinforcement learning for control, these simulators theoretically allow algorithms to be applied directly to analytic gradients of the reward function. However, to date, these gradients have proved extremely challenging to use, and are outclassed by algorithms using no gradient information at all. In this work we present a novel algorithm, cross entropy analytic policy gradients, that is able to leverage these gradients to outperform state of art deep reinforcement learning on a set of challenging nonlinear control problems.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Sean Gillen (5 papers)
  2. Katie Byl (12 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.