Maximum diffusion reinforcement learning