HyperPPO: A scalable method for finding small policies for robotic control

Shashank Hegde Zhehui Huang Gaurav S. Sukhatme

University of Southern California