Surprise Minimization in Reinforcement Learning