Advanced Topics in Learning and Decision Making

Instructors:

Pieter Abbeel & Stuart Russell

Lectures:

Tue/Thu 2-3:30pm, via zoom

In-person or Virtual: This class will be offered virtually through zoom. Note that live attendance is expected as many of the lectures will be discussion oriented.

About:

This course is a research-oriented course which will cover advanced topics in learning and decision making. Unlike many other courses, this course is not about teaching you some fully-proven-out methods. Instead we will focus on areas where current methods are still falling short and further research is needed. This is also reflected in the course logistics, with great emphasis on live discussion, students presenting papers, homework that does not have a clear-cut unique solution but instead has you explore new ideas.

Office hours:

Pieter: Fridays 3-4pm

Stuart: Tuesdays 3:30-5pm

(see pinned piazza post for zoom links)

Communication:

primarily via our class piazza

Syllabus and Class Schedule
(subject to revision; weekly readings to be added):

Week 0
Introduction

8/26: Introduction [slides]

Week 1
Reward-Free Pre-Training and Exploration

Quiz 1: due 8/30
Reading List


8/31: Overview of Reward-Free Pre-Training and Exploration [slides][scribe]

9/2: Student Presentations [scribe]

Week 2
Self-Play and Curriculum

Quiz 2: due 9/6
Reading List

9/7: Self-play and the Alpha(Go)(Zero) series [slides][scribe]

9/9: Student Presentations [scribe]

Week 3 Hierarchical RL

Quiz 3: due 9/13
Reading List

9/14: An Overview on Hierarchical RL: Options, Hierarchical Abstractions of Machines, Feudal Learning [slides] [scribe]

9/16: Student Presentations [scribe]

Week 4
Meta-level Decision Making

Quiz 4: due 09/20
Reading List

9/21: Basic concepts of meta-level control and bounded optimality [slides][scribe]

9/23: Applications of meta-reasoning and bounded optimality [scribe]


Week 5

Human-in-the-Loop RL

Quiz 5: due 9/27
Reading List

9/28: Preference-based Learning [slides] [scribe]

9/30: Student Presentations [scribe]

[HW1 due] Fri 10/1 5pm [topics from Weeks 1-3]

Week 6

Inverse RL

Quiz 6: due 10/4
Reading List

10/5: An Overview of Inverse RL [slides] [scribe]

10/7: Deep Learning and Inverse RL [scribe]

Week 7

Assistance Games

Quiz 7: due 10/11
Reading List

10/12: Cooperative IRL [slides] [scribe]

10/14: Elaborations and alternatives [scribe]


[Project Proposal Due] Fri 10/15 5pm

Week 8

Multi-Task and Meta RL

Quiz 8: due 10/18
Reading List

10/19: Meta Reinforcement Learning and Few-Shot Imitation [slides] [scribe]

10/21: Multi-Task Reinforcement Learning [scribe]

Week 9

Multi-Agent RL

Quiz 9: due 10/25
Reading List

10/26: Distributed multi-body and multi-agent RL [slides][scribe]

10/28: Applications of multi-agent RL [scribe]


[HW2 due] Fri 10/29 at 5pm [topics from Weeks 4-7]

Week 10

Language and RL

Quiz 10 due 11/01
Reading List

11/2: Guest Lecturer: Karthik Narasimhan (Princeton) [slides][scribe]

11/4: Guest Lecturer: Jacob Andreas (MIT) [slides] [scribe]


Week 11

Model-based vs model-free

Quiz 11 due 11/08
Reading List

11/9: Model-based vs Model-free RL [slides] [scribe]

11/11: no lecture (university holiday)

Week 12

Theory of RL

Quiz 12 due 11/15
Reading List

11/16: Guest Lecturer: Sham Kakade (Harvard) [slides][scribe]

11/18: Model-based vs model-free RL (theory) [scribe]

Week 13

Applications

No Quiz due
Reading List

11/23: Guest Lecturer: Oriol Vinyals (Google Deepmind) [scribe]

11/25: no lecture (university holiday)

Week 14

Applications and Wrap-up

No Quiz due
Reading List

11/30: Guest Lecturer: Micah Carroll (Berkeley) and Dylan Hadfield-Menell (MIT) [slides][scribe]

12/2: Prospects for RL and AI [slides][scribe]