Isaac Gym: High Performance GPU-Based Physics
Simulation For Robot Learning🤖
Viktor Makoviychuk, Lukasz Wawrzyniak, Yunrong Guo, Michelle Lu, Kier Storey, Miles Macklin, David Hoeller, Nikita Rudin, Arthur Allshire, Ankur Handa, Gavriel State
Isaac Gym: High Performance GPU-Based Physics
Simulation For Robot Learning🤖
Viktor Makoviychuk, Lukasz Wawrzyniak, Yunrong Guo, Michelle Lu, Kier Storey, Miles Macklin, David Hoeller, Nikita Rudin, Arthur Allshire, Ankur Handa, Gavriel State
End-to-end GPU accelerated simulator for robotics and AI research
Abstract
Isaac Gym offers a high performance learning platform to train policies for wide variety of robotics tasks directly on GPU. Both physics simulation and neural network  policy  training  reside  on  GPU  and  communicate  by  directly  passing data from physics buffers to PyTorch tensors without ever going through any CPU bottlenecks.  This leads to blazing fast training times for complex robotics tasks on a single GPU with 1-2 orders of magnitude improvements compared to conventional RL training that uses a CPU based simulator and GPU for neural networks. Isaac gym can be downloaded at https://developer.nvidia.com/isaac-gym
This work is predated by our previous effort on accelerating physics simulation on GPU: https://sites.google.com/view/accelerated-gpu-simulation/home
Benchmarking Simulation Performance
Ant
Rewards and effective FPS with respect to number of parallel environments for the Ant experiment. Best training time is achieved with 8192 environments and a horizon lengths of 16.
Humanoid
Rewards and effective FPS with respect to number of parallel environments for the Humanoid experiment. Best training time is achieved with 4096 environments and a horizon lengths of 32.
Rewards and effective FPS with respect to number of parallel environments for the Humanoid experiment. Best training time is achieved with both 4096 and 8192 environments and horizon lengths of 64 and 32 respectively.
Shadow Hand
Rewards and effective FPS with respect to number of parallel environments for the Shadow Hand experiment. Best training time is achieved with both 8192 and 16384 environments and horizon lengths of 16 and 8 respectively.
Benchmarking Environment Performance
Ant
Humanoid
Shadow Hand Feed Forward Policy
Shadow Hand LSTM Policy
Allegro Hand Standard
ANYmal Robot
AMP: Adversarial Motion Priors