Learning Robust Rewards with Adversarial Inverse Reinforcement Learning