Trajectory to identify the system parameters of the physical Barret WAM.
Trajectory to identify the system parameters of the string and ball dynamics.
The Videos are replayed with 0.5x speed. Green trajectories highlight a successful Ball in a Cup movement. Red trajectories display failure cases. Each video shows the optimal policy trained using different reinforcement learning seed but the identical learned model.
The Videos are replayed with 0.5x speed. Green trajectories highlight a successful Ball in a Cup movement. Red trajectories display failure cases. Each video shows the optimal policy trained using different reinforcement learning seed but the identical learned model.
The Videos are replayed with 0.5x speed. Green trajectories highlight a successful Ball in a Cup movement. Red trajectories display failure cases. Each video shows the optimal policy trained using different reinforcement learning seed but the identical learned model.