Domain Randomization and Generative Models for Robotic Grasping

Josh Tobin, Wojciech Zaremba, and Pieter Abbeel

OpenAI and UC Berkeley

Abstract: Deep learning-based robotic grasping has made significant progress the past several years thanks to algorithmic improvements and increased data availability. However, state-of-the-art models are often trained on as few as hundreds or thousands of unique object instances, and as a result generalization can be a challenge.

In this work, we explore a novel data generation pipeline for training a deep neural network to perform grasp planning that applies the idea of domain randomization to object synthesis. We generate millions of unique, unrealistic procedurally generated objects, and train a deep neural network to perform grasp planning on these objects.

Since the distribution of successful grasps for a given object can be highly multimodal, we propose an autoregressive grasp planning model that maps sensor inputs of a scene to a probability distribution that corresponds to the model's estimate of the normalized probability of success of a grasp conditioned on the observations. Our model allows us to sample grasps efficiently at test time (or avoid sampling entirely).

We evaluate our model architecture and data generation pipeline in simulation and in the real world. We find that we can achieve >90% success rate on previously unseen realistic objects at test time in simulation despite having only been trained on random objects, and ~80% success rate on objects in the real world despite having only been trained in simulation.

arXiv: https://arxiv.org/abs/1710.06425