I am currently researching using reinforcement learning (specifically Natural Policy Gradients) for learning walking policies for a Cassie robot. I have modified Julia open-source learning libraries and created infrastructure and wrapper code to interface with Cassie simulator libraries in order to create a working training environment.
To aid this project, I am also applying reinforcement learning to a much simpler and smaller problem on a SLIP model, a reduced-order model that applies to Cassie. Successful policies in a SLIP domain can help inform how the full robot should act.
I am also investigating how learned policies are transferred to hardware. Particularly, I am interested in finding out what is needed to cross the sim-to-real gap, i.e. what needs to model, how good of a model is needed, etc.