Imitation Learning
Init
Pixelwise
Perceptual
Ours-no-grad
Ours
Target
* GIF might be large, please be patient while loading
We reconstruct the action sequence of the rod by watching a target video in this experiment, whose values are randomly initialized.
Only ours-no-grad and ours converge to a reasonable action sequence.