ai-class.com 2011 archive

10. Reinforcement Learning

Unit 10 01 Introduction.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-1'

Unit 10 2 Successes.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-2'

(DO NOT WATCH-SAME Video as above but INCOMPLETE-Watch preceding video instead. This video is linked just for archival purpose) Unit 10 02 Successes.mp4

Unit 10 03 Forms of Learning.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-3'

Unit 10 04 Forms of Learning Question.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-4'

Unit 10 05 Forms of Learning Answer.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-4'

Unit 10 06 MDP Review.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-5'

Unit 10 07 Solving a MDP.mp4

Note: argmax should be max in the formula for utility.

Discuss this question on aiqus. When posting use the tag 'unit10-6'

Unit 10 08 Agents of Reinforcement Learning.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-7'

Unit 10 09 Passive vs Active.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-8'

Unit 10 10 Passive Temporal Difference Learning.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-9'

Unit 10 11 Passive Agent Results.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-10'

Unit 10 12 Weaknesses Question.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-11'

Unit 10 13 Weaknesses Answers.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-11'

Unit 10 14 Active Reinforcement Learning.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-12'

Unit 10 15 Greedy Agent Results.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-13'

Unit 10 16 Balancing Policy.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-14'

Unit 10 17 Errors in Utility Questions.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-15'

Unit 10 18 Errors in Utility Answers.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-15'

Unit 10 19 Exploration Agents.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-16'

Unit 10 20 Exploration Agent Results.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-17'

Unit 10 21 Q Learning 1.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-18'

Unit 10 22 Q Learning 2.mp4

Important: Earlier clarification to use the formula on Wikipedia was not correct. Please use the formula as displayed in this video and for the homework. The s in R(s) is for the current state and not R(s') as in other formulations of Q-learning.

Discuss this question on aiqus. When posting use the tag 'unit10-19'

Unit 10 23 Pacman 1.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-20'

Unit 10 24 Pacman 2.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-21'

Unit 10 25 Conclusion.mp4

Discuss this question on aiqus. When posting use the tag 'unit10-22

Page updated

Google Sites

Report abuse