Discuss this question on aiqus. When posting use the tag 'unit10-1'
Discuss this question on aiqus. When posting use the tag 'unit10-2'
Unit 10 03 Forms of Learning.mp4
Discuss this question on aiqus. When posting use the tag 'unit10-3'
Unit 10 04 Forms of Learning Question.mp4
Discuss this question on aiqus. When posting use the tag 'unit10-4'
Unit 10 05 Forms of Learning Answer.mp4
Discuss this question on aiqus. When posting use the tag 'unit10-4'
Discuss this question on aiqus. When posting use the tag 'unit10-5'
Note: argmax should be max in the formula for utility.
Discuss this question on aiqus. When posting use the tag 'unit10-6'
Unit 10 08 Agents of Reinforcement Learning.mp4
Discuss this question on aiqus. When posting use the tag 'unit10-7'
Unit 10 09 Passive vs Active.mp4
Discuss this question on aiqus. When posting use the tag 'unit10-8'
Unit 10 10 Passive Temporal Difference Learning.mp4
Discuss this question on aiqus. When posting use the tag 'unit10-9'
Unit 10 11 Passive Agent Results.mp4
Discuss this question on aiqus. When posting use the tag 'unit10-10'
Unit 10 12 Weaknesses Question.mp4
Discuss this question on aiqus. When posting use the tag 'unit10-11'
Unit 10 13 Weaknesses Answers.mp4
Discuss this question on aiqus. When posting use the tag 'unit10-11'
Unit 10 14 Active Reinforcement Learning.mp4
Discuss this question on aiqus. When posting use the tag 'unit10-12'
Unit 10 15 Greedy Agent Results.mp4
Discuss this question on aiqus. When posting use the tag 'unit10-13'
Unit 10 16 Balancing Policy.mp4
Discuss this question on aiqus. When posting use the tag 'unit10-14'
Unit 10 17 Errors in Utility Questions.mp4
Discuss this question on aiqus. When posting use the tag 'unit10-15'
Unit 10 18 Errors in Utility Answers.mp4
Discuss this question on aiqus. When posting use the tag 'unit10-15'
Unit 10 19 Exploration Agents.mp4
Discuss this question on aiqus. When posting use the tag 'unit10-16'
Unit 10 20 Exploration Agent Results.mp4
Discuss this question on aiqus. When posting use the tag 'unit10-17'
Discuss this question on aiqus. When posting use the tag 'unit10-18'
Important: Earlier clarification to use the formula on Wikipedia was not correct. Please use the formula as displayed in this video and for the homework. The s in R(s) is for the current state and not R(s') as in other formulations of Q-learning.
Discuss this question on aiqus. When posting use the tag 'unit10-19'
Discuss this question on aiqus. When posting use the tag 'unit10-20'
Discuss this question on aiqus. When posting use the tag 'unit10-21'
Discuss this question on aiqus. When posting use the tag 'unit10-22