Dec 17, 2015
8:30am
8:40am-9:15am
9:25am-10:00am
10:00am-10:40am
10:40am-11:15am
11:20am-11:55am
12:00pm-1:00pm
1:00pm - 2:20pm
2:25pm-3:00pm
3:00pm-4:00pm
4:00pm-4:50pm
4:50pm-6:00pm
Opening
Invited talk: Sanja Fidler, U. of. Toronto
Towards understanding stories in movies
Invited talk: Tamara Berg, UNC Chapel Hill
Image Description Generation and Beyond...
Coffee Break - Posters
Invited talk: Devi Parikh, Virginia Tech
Visual Question Answering (VQA)
Invited talk: Svetlana Lazebnik, UIUC
Image Description: From Image-Sentence Embeddings to Region-Phrase Correspondence
Lunch break
Oral Session
Semantic Enrichment of Bag-of-Visual-Words
Anirudh Goyal, Shailesh Kumar, C.V Jawahar
extended abstract
A Multi-scale Multiple Instance Video Description Network,
Huijuan Xu,, Kate Saenko
extended abstract
Grounding of Textual Phrases in Images by Reconstruction
Anna Rohrbach, Marcus Rohrbach, Ronghang Hu, Trevor Darrell, Bernt Schiele
extended abstract
Relating Natural Language and Visual Recognition
Marcus Rohrbach, Jacob Andreas, Trevor Darrell, Lisa Anne Hendricks, Dan Klein, Ronghang Hu, Raymond Mooney. Anna Rohrbach, Kate Saenko, Bernt Schiele, Subhashini Venugopalan
extended abstract
Sherlock: Modeling Structured Knowledge in Images
Mohamed Elhoseiny, Scott Cohen, Walter Chang, Brian Price, Ahmed Elgammal
extended abstract
Joint Learning from Video and Caption
Tingran Wang, Nishant Shukla, Caiming Xiong
extended abstract (poster only)
Invited talk: Kate Saenko, UMass Lowell
Implicit and Explicit Attention in Neural Models of Language and Vision.
Posters Session - Coffee Break
Closing Keynote: Richard Socher, MetaMind
Deep Learning for Multimodal Text-Image Modeling
Panel Discussion
Panelist: Scott Cohen (Adobe), Svetlana Lazebnik(UIUC), Kate Saenko (UMass), Richard Socher (MetaMind)