Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Thinking While Moving: Deep Reinforcement Learning with Concurrent Control (2004.06089v4)

Published 13 Apr 2020 in cs.LG, cs.AI, cs.RO, and stat.ML

Abstract: We study reinforcement learning in settings where sampling an action from the policy must be done concurrently with the time evolution of the controlled system, such as when a robot must decide on the next action while still performing the previous action. Much like a person or an animal, the robot must think and move at the same time, deciding on its next action before the previous one has completed. In order to develop an algorithmic framework for such concurrent control problems, we start with a continuous-time formulation of the BeLLMan equations, and then discretize them in a way that is aware of system delays. We instantiate this new class of approximate dynamic programming methods via a simple architectural extension to existing value-based deep reinforcement learning algorithms. We evaluate our methods on simulated benchmark tasks and a large-scale robotic grasping task where the robot must "think while moving".

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Ted Xiao (40 papers)
  2. Eric Jang (19 papers)
  3. Dmitry Kalashnikov (34 papers)
  4. Sergey Levine (531 papers)
  5. Julian Ibarz (26 papers)
  6. Karol Hausman (56 papers)
  7. Alexander Herzog (32 papers)
Citations (36)

Summary

We haven't generated a summary for this paper yet.

Youtube Logo Streamline Icon: https://streamlinehq.com