In-Simulation Experiments

This section compares our synchronous and asynchronous implementation of ME-PPO in four Mujoco environments. Wall-clock time corresponding to the checkpoint time-steps are reported in the table below each video.

HalfCheetah

halfcheetah.mp4

Ant

ant.mp4

Hopper

hopper.mp4

Walker2D

walker2d.mp4