Adversarial Object Rearrangement in Constrained Environments

with Heterogeneous Graph Neural Networks


Abstract

Adversarial object rearrangement in the real world (e.g., previously unseen or oversized items in kitchens and stores) could benefit from understanding task scenes, which inherently entail heterogeneous components such as current objects, goal objects, and environmental constraints. The semantic relationships among them (e.g., the "meat can" and "shelf" are related by "off") are distinct from each other and crucial for multi-skilled robots to perform efficiently in everyday scenarios. We propose a hierarchical robotic manipulation system that learns the underlying relationships and maximizes the collaborative power of its diverse skills (e.g., pick-place,  push) for rearranging adversarial objects in constrained environments. The high-level coordinator employs a heterogeneous graph neural network (HetGNN), which reasons about the current objects, goal objects, and environmental constraints; the low-level 3D Convolutional Neural Network-based actors execute the action primitives. Our approach is trained entirely in simulation, and achieved an average success rate of 87.88% and a planning cost of 12.82 in real-world experiments, surpassing all baseline methods.


In this adversarial object rearrangement task, the color-coded heterogeneous task components (e.g., current objects, goal objects, and environmental constraints) are linked by different semantic relationships that are crucial to efficiently guiding a multi-skilled robot. By understanding that the ``bowl'' and the ``shelf'' are related by ``on'', a robot will swiftly push it to the nearby goal and clear space for the ``meat can'', which requires pick-place to move from ``ground'' to ``shelf''.

An example of real-world experiments in the Shelf scenario. The hierarchical system coordinates pick-place and push while selecting the most feasible target at each time step by considering all different types of scene components and relations among them.