03.21 Lec 13

Post date: Mar 20, 2017 10:57:3 AM

Lecture: Introduction to Reinforcement Learning

Reading: Slides by David Silver, Sutton and Barto Chapter 1

Demos: Helicopter, Atari, Pancakes, More

Lab: QA session for the projects.

Project: Baseline model due March 22. Implement a baseline model that generates random outputs. Your baseline model should be able to minibatch and read your train/dev/test data, calculate the average train/dev/test loss and task specific evaluation metric (e.g. accuracy) correctly. Update your Google Doc, Sheet and GitHub repo to complete the assignment.

Page updated

Report abuse