Towards AGI:
Scaling, Alignment & Emergent Behaviors in Neural Nets
IFT 6760A Winter 2023, Université de Montréal / Mila - Quebec AI Institute
see the links
This course is in a hybrid format (both in person at Mila and online)
Call-in link (4:30pm-6:30pm, EST Mon & Thu) Discussions: this channel (AGI discord) Video/slides: schedule
Course Description
This seminar-style course will focus on recent advances in the rapidly developing area of "foundation models", i.e. large-scale neural network models (e.g., GPT-3, CLIP, DALL-e, etc) pretrained on very large, diverse datasets. Such models often demonstrate significant improvement in their few-shot generalization abilities, as compared to their smaller-scale counterparts, across a wide range of downstream tasks - what one could call a "transformation of quantity into quality" or an "emergent behavior". This is an important step towards a long-standing objective of achieving Artificial General Intelligence (AGI). By AGI here we mean literally a "general", i.e. broad, versatile AI capable of quickly adapting to a wide range of situations and tasks, both novel and those encountered before - i.e. achieving a good stability (memory) vs plasticity (adaptation) trade-off, using the continual learning terminology. In this course, we will survey most recent advances in large-scale pretrained models, focusing specifically on empirical scaling laws of such systems' performance, with increasing compute, model size, and pretraining data (power laws, phase transitions). We will also explore the trade-off between the increasing AI capabilities and AI safety/alignment with human values, considering a range of evaluation metrics beyond the predictive performance. Finally, we will touch upon several related fields, including transfer-, continual- and meta-learning, as well as out-of-distribution generalization, robustness and invariant/causal predictive modeling.
In this course, besides several introductory and invited lectures by the instructor and guest speakers, respectively, we will survey and present recent papers listed in the "Topics & Papers" section from the menu on top of this page. If you have any suggestions about the papers to review, please contact the instructor and/or the TAs.
Course Outline
Class info
Instructor: Irina Rish (irina.rish at mila.quebec)
Course info on AGI discord
UdeM DIRO courses: Winter 2023
This course: IFT 676A Winter 2023
Neural Scaling Laws workshop series
Links
Dates: Jan-Apr 2023
Time: Mon 16:30 - 18:30, Thu 16:30 - 18:30
Dates: Jan 9 - Apr 13 (projects/presentations till Apr 30)
Mon Jan 9 - Feb 20 , Thu Jan 12 - Feb 23
break Feb 24 - Mar 5
Mon Mar 6 - Apr 3, Thu Mar 9 - Apr 13
Location: Auditorium 2 at Mila, 6650, boul. St-Urbain, Montréal
Streaming: call-in link
Evaluation Criteria
Paper presentations: 40%
Class project (report + poster presentation): 50%
Class participation: asking questions, participating in discussions (on slack/in class): 10%
Note: due to time zone differences, it may be difficult for all students to join all classes in person; the classes will be recorded, and questions regarding the papers to be discussed can be submitted on the course slack.