Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

End-to-end Reinforcement Learning of Robotic Manipulation with Robust Keypoints Representation (2202.06027v1)

Published 12 Feb 2022 in cs.RO and cs.CV

Abstract: We present an end-to-end Reinforcement Learning(RL) framework for robotic manipulation tasks, using a robust and efficient keypoints representation. The proposed method learns keypoints from camera images as the state representation, through a self-supervised autoencoder architecture. The keypoints encode the geometric information, as well as the relationship of the tool and target in a compact representation to ensure efficient and robust learning. After keypoints learning, the RL step then learns the robot motion from the extracted keypoints state representation. The keypoints and RL learning processes are entirely done in the simulated environment. We demonstrate the effectiveness of the proposed method on robotic manipulation tasks including grasping and pushing, in different scenarios. We also investigate the generalization capability of the trained model. In addition to the robust keypoints representation, we further apply domain randomization and adversarial training examples to achieve zero-shot sim-to-real transfer in real-world robotic manipulation tasks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Tianying Wang (15 papers)
  2. En Yen Puang (6 papers)
  3. Marcus Lee (3 papers)
  4. Yan Wu (109 papers)
  5. Wei Jing (33 papers)
Citations (5)

Summary

We haven't generated a summary for this paper yet.