Reinforcement learning compared to unsupervised learning