Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. Architectures such as Convolutional Neural Networks (CNNs) and more recently the Transformer based on Attention mechanisms are solving many problem in the Computer Vision field. This course will cover the principles of deep learning applied to different Computer Vision applications as well as covering the basics of Multimodal Learning.
The course webpage can be found: [2019], [2018], [2017], [2016]
Learning Paradigms
Instructor: Xavier Giró-i-Nieto
Computer Vision
Instructor: Kevin McGuinness
Computer Vision
Instructor: Eva Mohedano
Computer Vision
Instructor: Laura Leal-Taixé
Computer Vision
Instructor: Laura Leal Taixé
Computer Vision
Instructor: Miriam Bellver
Computer Vision
Instructor: Elisa Sayrol
Computer Vision
Instructor: Miriam Bellver
Computer Vision
Instructor: Miriam Bellver
Computer Vision
Instructor: Elisa Sayrol
Computer Vision
Instructor: Victor Campos
Computer Vision
Instructor: Laura Leal Taixé
Computer Vision
Instructor: Eva Mohedano
Computer Vision
Instructor: Kevin McGuinness
Computer Vision
Instructor: Laura Leal Taixé
Computer Vision
Instructor: Javier Ruiz
Computer Vision
Instructor: Eduard Ramon
Generative models
Instructor: Kevin McGuinness
Multimodal Learning
Instructor: Xavier Giró-i-Nieto
Multimodal Learning
Instructor: Eva Mohedano
Multimodal Learning
Instructor: Xavier Giró-i-Nieto