Learning to Compose Hierarchical Object-Centric Controllers for Robotic Manipulation
Mohit Sharma*, Jacky Liang*, Jialiang (Alan) Zhao, Alex LaGrassa, Oliver Kroemer
* Equal Contribution[arXiv]
Mohit Sharma*, Jacky Liang*, Jialiang (Alan) Zhao, Alex LaGrassa, Oliver Kroemer
* Equal Contribution[arXiv]
Below we show learned policies for all tasks using all the different baselines and our proposed ExpandMDP based approaches.
End-Effector Action Space
One Controller
3-Priority
Proposed: 3-ExpandMDP-Single 3-ExpandMDP-Multi
End-Effector Action Space
One Controller
3-Priority
Proposed: 3-ExpandMDP-Single 3-ExpandMDP-Multi
End-Effector Action Space
One Controller
3-Priority
Proposed: 3-ExpandMDP-Single 3-ExpandMDP-Multi
End-Effector Action Space
One Controller
3-Priority
Proposed: 3-ExpandMDP-Single 3-ExpandMDP-Multi
End-Effector Action Space
One Controller
3-Priority
Combo 3-ExpandMDP-Single 3-ExpandMDP-Multi
End-Effector Action Space
One Controller
3-Priority
3-Combo 3-ExpandMDP-Single 3-ExpandMDP-Multi
The above figures plot the controllers selected by the 3-ExpandMDP-Multi method for the Block Fit task. As seen in the above figures the proposed approach uses object-axes controllers associated with different objects in parallel. More discussion for these results are in the Appendix of the paper.