Programme

Monday April 4, 2022

17:00 – 17:10 Introduction (D. Karatzas) [slides] [video]

17:10 17:50 Common blocks for multi-modal systems (E. Valveny) [slides] [video]

BREAK

18:00 – 18:40 Reading in the wild (S. Garcia) [slides] [video]

18:40 19:00 Scene text for Fine Grained Image Classification (A. Mafla) [slides] [video]


Wednesday April 6, 2022

17:00 – 17:20 Cross-modal retrieval (A. Mafla) [slides] [video]

17:20 – 17:50 Scene text Visual Question Answering (A. Biten) [slides] [video]

BREAK

18:00 – 18:30 Document Visual Question Answering (R. Perez) [slides] [video]

18:30 – 19:00 Demo session (L. Gomez) [GitHub] [video]

Day 1 Video

Day 2 Video