Post date: Mar 20, 2017 10:57:3 AM
Lecture: Introduction to Reinforcement Learning
Reading: Slides by David Silver, Sutton and Barto Chapter 1
Demos: Helicopter, Atari, Pancakes, More
Lab: QA session for the projects.
Project: Baseline model due March 22. Implement a baseline model that generates random outputs. Your baseline model should be able to minibatch and read your train/dev/test data, calculate the average train/dev/test loss and task specific evaluation metric (e.g. accuracy) correctly. Update your Google Doc, Sheet and GitHub repo to complete the assignment.