The Y-axis is the percentage of different rooms that the robot arrives in.
The open circle denotes the goal, and line with a small solid circle at the end indicates a trajectory.
The line with a small solid circle at the end indicates the trajectory.
Push blue circle to [6.7, 8.0].
Push green square to [5.3, 3.9].
Push blue trangle to [6.3, 5,8].
Push red square to [8.7, 9.2].
Swimmer (Left: goal behavior; Middel: GPIM behavior; Right: stacked view of goal and GPIM behaviors.)
HalfCheetah(Left: goal behavior; Middel: GPIM behavior; Right: stacked view of goal and GPIM behaviors.)
Robot(Left: goal behavior; Middel: GPIM behavior; Right: stacked view of goal and GPIM behaviors.)
Montezuma Revenge(Left: goal behavior; Middel: GPIM behavior; Right: stacked view of goal and GPIM behaviors.)
Seaquest(Left: goal behavior; Middel: GPIM behavior; Right: stacked view of goal and GPIM behaviors.)
Berzerk(Left: goal behavior; Middel: GPIM behavior; Right: stacked view of goal and GPIM behaviors.)