Week 12: Theory of RL
Core Readings
Chapters 5 and 9 of Alekh Agarwal, Nan Jiang, Sham M. Kakade, and Wen Sun, Reinforcement Learning: Theory and Algorithms. Manuscript in preparation.
Stephen Tu and Benjamin Recht, The gap between model-based and model-free methods on the linear quadratic regulator: An asymptotic viewpoint. COLT 2019.
Kefan Dong, Yuping Luo, and Tengyu Ma, On the expressivity of neural networks for deep reinforcement learning. ICML 2020.
Extended Readings