IFT 6135 - Representation Learning
Fall 2022, A Deep Learning Course offered by the Université de Montréal
IFT6135-A22 DIRO, Université de Montréal
Course Description:
This is a course on representation learning in general and deep learning in particular. Deep learning has recently been responsible for a large number of impressive empirical gains across a wide array of applications including most dramatically in object recognition and detection in images, natural language processing and speech recognition.
In this course we will explore both the fundamentals and recent advances in the area of deep learning. Our focus will be on neural network-type models including convolutional neural networks and recurrent neural networks such as LSTMs. We will also review recent work on attention mechanism and efforts to incorporate memory structures into neural network models. We will also consider some of the modern neural network-base generative models such as Generative Adversarial Networks and Variational Autoencoders.
Textbook for independent reading by students: Deep Learning by Ian Goodfellow, Yoshua Bengio and Aaron Courville (available for order at amazon or online for free here).
In particular topics covered by the course include the following:
Neural networks structure and training algorithms
Convolutional neural networks
Recurrent neural networks
Transformer Networks
Optimization methods for training neural networks
Regularization of neural networks
Normalization Methods (Batch Normalization, etc.)
Attention Mechanism
Applications to Computer Vision tasks (Image Segmentation, Bounding Box Detection, etc.)
Applications to Natural Language Processing tasks
Autoencoders
Variational Autoencoders
Generative Adversarial Networks (GANs)
Autoregressive Generative Models
Neural Flow-based Generative Models
Meta-Learning
Self-supervised Learning
Instruction style: The lecture time will be devoted to traditional lectures. Students are responsible for keeping up-to-date with the course material outside of class time, mainly by reading the textbook and other readings assigned for each lecture. The material to be reviewed for each class will be made available on this course website.
Evaluation:
The course will include a set of 3 assignments. Each assignment will consist of two components:
A problem set of theoretical questions
A practical component involving programming and running experiments.
See the Assignment Policy for more information.
In addition to these assignments there will be a final exam. Both the practical and theoretical component of the assignments is to be done individually.
The final grade will be composed as follows:
Assignments: 3 * 25% = 75% (For each assignment, 25% = 10% (theory) + 15% (practical)
Final Exam: 25%
Instructors:
Instructor: Prof. Aishwarya Agrawal <aishwarya.agrawal (at) mila.quebec>
Teacher Assistants:
Arian Khorasani <arian.khorasani (at) mila.quebec>
Nanda H Krishna <nanda.harishankar-krishna (at) mila.quebec>
Sarthak Mittal <mittalsa (at) mila.quebec>
Previous Exams:
Warning: material changes from year to year.