CS294-190 -- Fa21

Week 12: Theory of RL

Core Readings

Chapters 5 and 9 of Alekh Agarwal, Nan Jiang, Sham M. Kakade, and Wen Sun, Reinforcement Learning: Theory and Algorithms. Manuscript in preparation.

Stephen Tu and Benjamin Recht, The gap between model-based and model-free methods on the linear quadratic regulator: An asymptotic viewpoint. COLT 2019.
Kefan Dong, Yuping Luo, and Tengyu Ma, On the expressivity of neural networks for deep reinforcement learning. ICML 2020.

Extended Readings

Page updated

Google Sites

Report abuse