Search this site
Embedded Files
Skip to main content
Skip to navigation
Goal-oriented Trajectories for Efficient Exploration
π Exploration (episode 300)
DQN with epsilon-greedy exploration
DQN with Q-map exploration
π© First flag
DQN with epsilon-greedy exploration
DQN with Q-map exploration
π Highest score (with flag)
DQN with epsilon-greedy exploration
DQN with Q-map exploration
π Results
Random walk (red) and Q-map walk (green)
DQN with epsilon-greedy (red) and DQN with Q-map (green)
Performance comparison
Google Sites
Report abuse
Google Sites
Report abuse