Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup