REINFORCEMENT LEARNING: PLAYING TO WIN