White and black signify high and low attention values respectively. Attention is correctly paid to the agent and/or target in each domain. Distractors are suppressed. For JacoReach, attention is paid to every other link of the Kinova arm. As the system is constrained, the state of every link can be inferred by attending alternating links. For Walker2D, attention is dynamic in object space and varies based on the state and stability of the walker. For the extrapolation domains with additional 4 or 8 distractors, APRiL's attention generalises favourably, suppressing additional distractors, and the resulting policies perform well.