למידת חיזוק
למידת ממוצע פרסים (״Discounted reinforcement learning is fundamentally incompatible with function approximation for control in continuing tasks. It is not an optimization problem in its usual formulation, so when using function approximation there is no optimal policy״, לקריאת המאמר לחץ כאן).