Whole-Body Neural Policy for Zero-Shot Cross-Embodiment Motion Planning

Prabin Kumar Rath, Nakul Gopalan

Arizona State University

Official Webpage: https://prabinrath.github.io/xmop/ | Code: https://github.com/prabinrath/xmop

Abstract

Classical manipulator motion planners work across different robot embodiments. However they plan on a pre-specified static environment representation, and are not scalable to unseen dynamic environments. Neural Motion Planners (NMPs) are an appealing alternative to conventional planners as they incorporate different environmental constraints to learn motion policies directly from raw sensor observations. Contemporary state-of-the-art NMPs can successfully plan across different environments. However none of the existing NMPs generalize across robot embodiments. In this paper we propose Cross-Embodiment Motion Policy (XMoP), a neural policy for learning to plan over a distribution of manipulators. XMoP implicitly learns to satisfy kinematic constraints for a distribution of robots and zero-shot transfers the planning behavior to unseen robotic manipulators within this distribution. We achieve this generalization by formulating a whole-body control policy that is trained on planning demonstrations from over three million procedurally sampled robotic manipulators in different simulated environments. Despite being completely trained on synthetic embodiments and environments, our policy exhibits strong sim-to-real generalization across manipulators with different kinematic variations and degrees of freedom with the same set of frozen policy parameters. We evaluate XMoP on 7 commercially available manipulators and show successful cross-embodiment motion planning, achieving an average 70% success rate on baseline benchmarks. Furthermore, we show sim-to-real demonstrations on two unseen manipulators solving novel planning problems across eight unstructured real-world environments even with dynamic obstacles.

xmop_demo.mp4

Cross Embodiment Motion Policy (XMoP) is a Behavior Cloning (BC) policy trained on synthetic planning demonstrations that zero-shot transfers to unseen robotic manipulators and can plan in unstructured real-world environments.

All rollouts shown in the videos (both simulated and real) use XMoP with a fixed set of frozen policy parameters.

XMoP on Franka FR3

franka_22_merged.mp4

XMoP on Rethink Sawyer

sawyer_23_merged.mp4

franka_45_merged.mp4

sawyer_32_merged.mp4

franka_32_merged.mp4

sawyer_14_merged.mp4

XMoP uses Model Predictive Control (MPC) which allows for locally reactive planning in the presence of dynamic obstacles.

dynamic_7_updated.mp4

franka_dyn_1_mask (online-video-cutter.com).mp4

franka_dyn_5_mask (online-video-cutter.com) 2.mp4

franka_dyn_6_mask (online-video-cutter.com).mp4

franka_dyn_8_mask (online-video-cutter.com).mp4

XMoP generates smooth and optimal trajectories for reaching SE(3) targets in obstacle-free environments. Zero-shot control of 7 commercial robot embodiments.

xmop_s_rollouts.mp4

Fully Synthetic Planning Demonstrations

XMoP is trained on a distribution of synthetic embodiments, allowing the policy to zero-shot generalize to unseen commercial manipulators. We used the 3.27 million planning problems from the MpiNets dataset and generated demonstration data for each problem with a uniquely sampled embodiment!

A few samples from XMoP training data are shown in the video below.

xmop_training_data_edited.mp4

Real-World Experiments on 3 Domains

franka_unstructured_success.mp4

sawyer_unstructured_success.mp4

Unstructured Obstacle Domain: We test XMoP’s ability to plan in unstructured and cluttered environments which are hard for conventional planning algorithms that use primitive-based obstacle representations.

franka_wall_success.mp4

sawyer_wall_success.mp4

Wall Hopping Domain: We test XMoP's ability to plan in a structured environment with large obstacles that significantly occupy the space in front of the robot.

franka_bin_to_bin_success.mp4

sawyer_bin_to_bin_success.mp4

Bin-to-Bin Domain: We test XMoP’s ability to plan for a common industrial bin-to-bin motion task, where the manipulator needs to move its end-effector from inside one bin to another.

Failure Modes of XMoP

franka_bin_to_bin_failure.mp4

Hits the walls of the bin while approaching.

franka_unstructured_failure.mp4

Partially observable obstacle leads to collision.

sawyer_wall_failure.mp4

Failure for goal too close to the obstacle.

Page updated

Google Sites

Report abuse