Adapting Visual Policies via Predicted Rewards