Breaking the Performance Ceiling in Reinforcement Learning requires Inference Strategies