Search this site
Embedded Files
Skip to main content
Skip to navigation
Relative Entropy Regularized Policy Iteration
Relative Entropy Regularized Policy Iteration
Videos on solving tasks from:
1- Real Robot Using Raw Pixels
2- Parkour Suite
3- DeepMind Control suite
4- OpenAI Gym
In order to solve all these tasks we use:
1- One single hyperparameter set
2- Limited compute: 1 GPU for learning and 1 CPU/Robot for data generation (1 actor).
Task specifications:
Action dimensions range: 1 (cartpole) to 56 (CMU humanoid).
State dimensions range: 5 (cartpole) to 534 (Parkour humanoid).
Reward range: 50 (Parkour 2d) to 400,000 (OpenAI Gym humanoid-standup).
Real Robot Learning From Raw Pixels:
Setup: 1 robot for generating data,
5 action dimensions and pixels / proprioception as state
Successfully learned from scratch within 100 hours of interaction to reliably lift the object using raw pixels
MPO3_med.mp4
Parkour Walker:
Setup: 1 GPU for learning and 1 CPU (actor) for generating data
6 action dimensions and 120 state dimensions
Less than one day of training
parkour2dbest.mp4
Parkour Humanoid Gaps:
Setup: 1 GPU for learning and 1 CPU (actor) for generating data
22 action dimensions and 539 state dimensions
Less than two days of training
rtm_gaps.mp4
Parkour Humanoid Walls:
Setup: 1 GPU for learning and 1 CPU (actor) for generating data
22 action dimensions and 539 state dimensions
Less than two days of training
rtm_walls.mp4
rtm_walls2.mp4
Humanoid CMU-Stand from DM-Control Suite:
Setup: 1 GPU for learning and 1 CPU (actor) for generating data
56 action dimensions
policy_3060105_cam_0_video_30400.mp4
Other Control Suite Tasks
humanoid_stand.mp4
Humanoid-Stand
humanoid_run.mp4
Humanoid-Run
acrobat.mp4
Double Inverted Pendulum
policy_3645919_cam_0_video_71000.mp4
CartPole- Two Poles
rtm_three_poles3.mp4
CartPole- Three Poles
rtm_bringball.mp4
Manipulator
cheetah.mp4
Cheetah
fastest_walker.mp4
Walker
policy_1758926_cam_0_video_18600.mp4
Swimmer- 15 Links
policy_720317_cam_0_video_13200.mp4
Finger-Turn
policy_336392_cam_0_video_5800.mp4
Finger-Spin
policy_319663_cam_0_video_5600.mp4
Pendulum
policy_2061996_cam_0_video_26600.mp4
Fish
policy_732863_cam_0_video_8800.mp4
Swimmer- 6 Links
hopper4.mp4
Hopper
policy_302064_cam_0_video_6200.mp4
Reacher
OpenAI Gym Tasks
stand_up_gym.mp4
Humanoid Stand
humanoid_run_gym.mp4
Humanoid Run
Ant_Gym.mp4
Ant
cheetah_gym.mp4
Cheetah
hopper_gym.mp4
Hopper
walker2d_gym.mp4
Walker2d
Google Sites
Report abuse
Google Sites
Report abuse