HAAV: Hierarchical Aggregation of Augmented Views
for Image Captioning