DEMO³

Multi-Stage Manipulation with Demonstration-Augmented Reward, Policy, and World Model Learning

Adrià López Escoriza1,2Nicklas Hansen1, Stone Tao1,3,  Tongzhou Mu1,  Hao Su1,3

UC San Diego, ETH Zürich, Hillbot

ICML 2025