The conducted simulation studies of the Ball in a Cup experiments. The
red ball is simulated using MuJoCo while the
yellow shows the planned optimal ball trajectory using the internal model. The green and red frames, highlight whether the transfer was successful or not. Each roll-out shows the optimal trajectory of a different reinforcement learning seed using the identical model. The data acquisition is identical to the real robot experiments. In simulation we do not average across ball-trajectories as they are mostly deterministic in simulation.