Reading List

This is a reading list intended to give a flavor of the types of problems in deep learning and AI that practicing (and erstwhile) physicists have worked on.

Papers bracketed by ** are required reading, and appear in the schedule.

Please note, that this is a selected literature review. A large amount of high quality science is not listed, but can be found if you follow citations presented in the papers below.

Part 1: Deep Learning Primer

Concepts in Deep Learning

Inspiration:

Belkin, The Necessity of Machine Learning Theory in Mitigating AI Risk (2024)

A call to arms for physicsts to tackle just one of many existential risks to humanity

Zdeborova, Understanding deep learning is also a job for physicists (2020)

Review papers

Philosophy

Marr, Vision (1982), Ch. 1

Overparameterization and Double Descent

Feature vs Lazy Learning (how to scale deep nets)

Deep Learning Architectures

Phuong and Hutter, Formal Algorithms for Transformers (2022)

For the most part, neural network architectures are described well in theory papers. This one is devoted exclusively to describing transformers, and is useful once you penetrate the pseudocode notation.

Statistical Mechanics of Learning

Li and Sompolinsky, Statistical Mechanics of Deep Linear Neural Networks: The Backpropagating Kernel Renormalization (2021)

Language modeling

Bengio et al, A neural probabilistic language model (2003)

Signal Propagation

Signal prop in feedforward MLPs + architectural variants

RNNs

Transformers

Part 2: LLMs

Scaling Laws

Empirical

**Kaplan et al., Scaling Laws for Neural Language Models (2020)**
Henighan et al., Scaling Laws for Autoregressive Generative Modeling (2020)
**Hoffmann et al., Training Compute-Optimal Large Language Models (Chinchilla) (2022) **
OpenAI, GPT-4 technical report
Hernandez et al. (Anthropic), Scaling Laws and Interpretability of Learning from Repeated Data (2022)

Theory

In-Context Learning

Emergence of Capabilities

Hidden Capabilities

Grokking

Reinforcement Learning from Human Feedback (RLHF)

Mechanistic Interpretability