Formulated as an MDP, the task is to guard danger zones using cameras (O) so that if an intruder (▶) moves to a danger zone, at least one camera is pointing at that location. The episode is finished after 1000 steps. The initial grids of cameras and intruders are highlighted with the same color code [code]