Nan Jiang
Assistant Professor
Computer Science
University of Illinois Urbana-Champaign
Topic
Notes
1 - overview & MDP planning
Slides 1
Note 1
2 - tabular learning
Note 2-1
Note 2-2
3 - state abstraction
Slides 3
Note 3
4 - fitted Q-iteration
Slides 4
Note 4
5 - importance sampling and policy gradient
Note 5-1
Note 5-2
6 - exploration
Slides 6
Note 6-1
Note 6-2
Note 6-3
7 - predictive state representation
Slides 7