Workshop  Schedule
July 28th, 2023

Registration Schedule Workshop Series   Venue   References  




08:30 - 09:00 Arrival - coffee/tea


09:30 - 10:00 Irina Rish (University of Montreal/Mila):  An Overview video


10:00 - 10:30 Pascal Jr. Tikeng Notsawo (University of Montreal/Mila): Is grokking predictable? (remote talk)  video
paper: Predicting Grokking Long Before it Happens: A look into the loss landscape of models which grok


10:30 - 11:00 Eric Michaud (MIT): The Quantization Model of Neural Scaling  (remote talk) video

paper: The Quantization Model of Neural Scaling Interview with Eric AI Alignment forum


11:00 - 11:30 Elvis Dohmatob (Meta AI): Three Theoretical Insights for Robustness: The Bad, The Bad, and The Ugly! (remote talk) video


11:30 - 11:40  coffee break


11:40  - 12:10 Neel Nanda (Google DeepMind): Progress measures for grokking via mechanistic interpretability (remote talk) video


12:10 - 12:30 discussion


12:30 - 02:00 lunch (on your own)


02:15 - 02:45 Andrey  Gromov (University of Maryland): A solvable model for grokking modular arithmetic (remote talk) video

paper: Grokking modular arithmetic 


02:50 - 03:20 Rylan  Schaeffer (Stanford): Are emergent abilities of Large Language Models a mirage? video

paper: Are Emergent Abilities of Large Language Models a Mirage? 


03:20 - 03:40          discussion


03:40 - 04:00 coffee break


04:00 - 04:30 Ziming Liu (MIT/IAIFI): Intelligence from hunger video


04:30 - 05:00 Hugo  Cui (EPFL): Error scaling laws in kernel learning video


05:00 - 05:20 Kshitij Gupta  (University of Montreal/MilaAGI Collective video


05:20 - 05:40 discussion