Links to slides will be posted on this page. Please fill this short form (2mins) if you are interested in being added to the mailing list of the course!
The plan is to spend one lecture on each topic (the below plan may change as the course progresses).
Note that the slides are intended as study material: clickable links in the slides will send you to more detailed references.
Part I: Introduction to Deep Learning:
Introduction and a brief history of Neural Networks. Overview of the course. (Slides) Video recordings (in Italian): part 1 - 6min and part 2 - 80min
Backpropagation, Stochastic Gradient Descent, Convergence. (Slides) (Video recording (in Italian) 100min)
Some very common Neural Network architectures and their motivations (Slides) (Video (Ita) 120min)
Neuromorphic Neural Networks (skipped)
Part II: Staples of classical Deep Learning theory
Generalization: some mathematical interpretations (Slides) ( Video (Ita) 110min)
PAC learning, VC dimension, and expressivity tests for DNN (Slides) (Video parte 2 (Ita) 50 min)
Introduction to Information Theory, and the Information Bottleneck Principle (Slides) (Video parte 1 (45 min), Parte 2 (35 min))
no class on Dec. 1st
Part III: Selected topics of research
Network pruning: the "Lottery ticket hypothesis", and sparsity (Slides) (Video in italiano (90 min))
no class on Dec. 8th
Hyperbolic Neural Networks (Slides)
Equivariant Neural Networks (Slides)
Persistence diagrams and Topological Data Analysis (guest lecture by Sara Scaramuccia) (Slides)
Final presentations by students:
Luca Falorsi: Renormalization Group and Restricted Boltzmann Machines
Alessio Oliviero: Physics Informed Neural Networks for Optimal Control (Slides)
Jacopo Ulivelli: Geometric interpretation of GANs and link with Optimal Transport (Link to exposed paper)
Andrea Pizzi: Graph Neural Networks generalized to simplicial complexes (Slides)
Lorenzo D'Arca: Teoremi di approssimazione in spazi funzionali (Slides)(Slides in English, paper1, paper2)
Some bibliography given before the course start, for the first 6 lectures (see precise references within each lecture's slides):
History began with AlexNet: A comprehensive survey on Deep Learning -- https://arxiv.org/pdf/1803.01164v2.pdf (useful for lectures 1 and 3)
Deep Learning Book -- https://www.deeplearningbook.org/ (useful for lectures 1-5, but especially for lectures 2 and 3)
Some famous Deep Learning architectures -- https://towardsdatascience.com/neural-network-architectures-156e5bad51ba
Foundations of Machine Learning -- https://cs.nyu.edu/~mohri/mlbook/ (useful for lectures 5-6)
The modern mathematics of deep learning -- https://arxiv.org/pdf/2105.04026.pdf