Learning High-level Representations from Demonstations
Our Method after 0.9 million training steps HDRL [Kulkarni et al. 2016] after 2.5 million training steps DQN [Mnih et al. 2015] after 8 million training steps
Our Method after 2.5 million training steps
Ant Robot Learning with Discovered Subgoals