Explaining Recurrent Attention Models
Explaining Recurrent Attention Models
joint work with Kausik Sivakumar and Harsh Goel
MNIST image reconstruction
(leftmost original MNIST image, followed by reconstructed output from different glimpses)
MNIST cluttered image reconstruction
(leftmost original MNIST image, second cluttered image, followed by reconstructed output from different glimpses of cluttered image)
Comparison of Information gain between HardAttReshaped (ours) and HardAtt. Our method performs better on MNIST. On cluttered MNIST, our method's performance decreases with more glimpses. This might be because of more noisy information used for reward shaping.