Deep Learning for Computer Vision
Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. Architectures such as Convolutional Neural Networks (CNNs) and more recently the Transformer based on Attention mechanisms are solving many problem in the Computer Vision field. This course will cover the principles of deep learning applied to different Computer Vision applications as well as covering the basics of Multimodal Learning.
The course webpage can be found: [2019], [2018], [2017], [2016]
DLCV Lectures (2018)
Lecture 1.1: Neural Network Zoo
Learning Paradigms
Instructor: Xavier Giró-i-Nieto
Lecture 1.2: Image Classification
Computer Vision
Instructor: Kevin McGuinness
Lecture 1.3: Image Retrieval
Computer Vision
Instructor: Eva Mohedano
Lecture 1.4: Visual Localization
Computer Vision
Instructor: Laura Leal-Taixé
Lecture 1.5: Video Object Segmentation
Computer Vision
Instructor: Laura Leal Taixé
Lecture 2.1: Object Detection
Computer Vision
Instructor: Miriam Bellver
Lecture 2.2: Face Detection & Recognition
Computer Vision
Instructor: Elisa Sayrol
Lecture 2.3: Semantic Segmentation
Computer Vision
Instructor: Miriam Bellver
Lecture 2.4: Instance Segmentation
Computer Vision
Instructor: Miriam Bellver
Lecture 2.5: Medical Imaging
Computer Vision
Instructor: Elisa Sayrol
Lecture 3.1 & 3.2: Video Analysis
Computer Vision
Instructor: Victor Campos
Lecture 3.3: Object Tracking
Computer Vision
Instructor: Laura Leal Taixé
Lecture 3.4: Interpretability
Computer Vision
Instructor: Eva Mohedano
Lecture 3.5: Saliency Prediction
Computer Vision
Instructor: Kevin McGuinness
Lecture 3.6: Set Learning
Computer Vision
Instructor: Laura Leal Taixé
Lecture 4.1: 3D Analysis
Computer Vision
Instructor: Javier Ruiz
Lecture 4.2: 3D Reconstruction
Computer Vision
Instructor: Eduard Ramon
Lecture 4.3: Generative models
Generative models
Instructor: Kevin McGuinness
Lecture 4.4: Language and Vision
Multimodal Learning
Instructor: Xavier Giró-i-Nieto
Lecture 4.5: Audio and Vision
Multimodal Learning
Instructor: Eva Mohedano
Lecture 4.6: Speech and Vision
Multimodal Learning
Instructor: Xavier Giró-i-Nieto