D'Kitty Coordinate Videos

A hierarchical policy is trained in sim to coordinate two D'Kitties to move a box to a desired target location and orientation.

The learned behavior transfers to the real world in zero-shot fashion (target given by two white '+' signs on the floor).

Learned behavior generalizes to settings in which one end of the box is heavier than the other.