Scaling data-driven robotics with reward sketching and batch reinforcement learning