Videos of Go1 Real-World Trials
Task 1: Heavy Luggage
Walking (No Behavior Modulation)
Avg. Time: 45.3s
Avg Falls/Readjustments: 2.3
High-Level Classifier
Avg. Time: 42.7s
Avg Falls/Readjustments: 3
ROAM (ours)
Avg. Time: 25.7s
Avg Falls/Readjustments: 0.7
Task 2: Dynamic Load
Walking (No Behavior Modulation)
Avg. Time: 32s
Avg Falls/Readjustments: 1
High-Level Classifier
Avg. Time: 28.3s
Avg Falls/Readjustments: 1.3
ROAM (ours)
Avg. Time: 24.3s
Avg Falls/Readjustments: 0.3
Task 3: Roller Skates
Walking (No Behavior Modulation)
Avg. Time: NC
Avg Falls/Readjustments: NC
High-Level Classifier
Avg. Time: 62.3s
Avg Falls/Readjustments: 2.7
ROAM (ours)
Avg. Time: 27.3s
Avg Falls/Readjustments: 1
Sample Rollout in Simulation
In our simulated experiments, we find that ROAM is over 2x as efficient as prior methods that are designed for fast adaptation. Here we plot the behavior distribution for ROAM over the course of a single-life trial, where the agent is tasked with adapting to different stiffness on-the-go. Green bars indicate relevant behaviors to the current situations while red bars indicate irrelevant behaviors. We find that ROAM can quickly react to changing situations by choosing and adapting relevant behaviors on-the-fly.