Each frame of these videos is solving a model-based control optimization problem of the form
Each frame then shows the following information:
We start with the cheetah.run task from the DeepMind control suite with a frame skip of 4 and show the videos of 10 random evaluation episodes.
 cheetah-cem.mp4
cheetah-cem.mp4 cheetah-dcem.mp4
cheetah-dcem.mp4 cheetah-dcem-ppo.mp4
cheetah-dcem-ppo.mp4Next we look at walker.walk task with a frame skip of 2 and show the videos of 10 random evaluation episodes.
 walker-cem.mp4
walker-cem.mp4 walker-dcem.mp4
walker-dcem.mp4 walker-dcem-ppo.mp4
walker-dcem-ppo.mp4