Imitation Learning

Init

Pixelwise

Perceptual

Ours-no-grad

Ours

Target

* GIF might be large, please be patient while loading

We reconstruct the action sequence of the rod by watching a target video in this experiment, whose values are randomly initialized.

Only ours-no-grad and ours converge to a reasonable action sequence.