Pseudo code for Value Iteration and Policy Iteration (two different versions from the supplementary course textbooks) are copied here:
Pseudo code for Value Iteration and Policy Iteration (two different versions from the supplementary course textbooks) are copied here:
Value Iteration Pseudo-code (from R+N):
Policy Iteration Pseudo-code (from R+N):
Value Iteration Pseudo-code (from S+B):
Policy Iteration Pseudo-code (from S+B):