Our workshop will feature these vision & language challenges
https://sites.google.com/site/describingmovies
https://www.robots.ox.ac.uk/~vgg/research/condensed-movies/challenge.html
https://value-benchmark.github.io
https://ltvrr.github.io/challenge