Value-Based RL Scales Predictably