Post date: Apr 19, 2017 1:54:12 PM
Lecture: Policy Gradient Methods
Reading: Slides by David Silver, Sutton and Barto Chapter 13 (skipping TD(λ))
Lab: No lab this week, there will be a QA section for project related questions, please sign up following the email directions.
Project: (due Apr 28) Project report and presentation due. Presentations to take place in random order starting May 2nd.
Project related assignments will only be graded when they are 100% complete. Late completion will be graded based on the following formula: (1/2)+(1/2)^(d/3+1) where d is the number of days late.