Deep Interactive Motion Prediction and Planning
Playing Games with Motion Prediction Models
Jose Vazquez, Alexander Liniger, Wilko Schwarting, Daniela Rus, Luc van Gool
For an extended version of the L4DC-2022 article, please visit: https://arxiv.org/pdf/2204.02392.pdf
Abstract
In most classical Autonomous Vehicle (AV) stacks, the prediction and planning layers are separated, limiting the planner to react to predictions that are not informed by the planned trajectory of the AV. This work presents a module that tightly couples these layers via a game-theoretic Model Predictive Controller (MPC) that uses a novel interactive multi-agent neural network policy as part of its predictive model. In our setting, the MPC planner considers all the surrounding agents by informing the multi-agent policy with the planned state sequence. Fundamental to the success of our method is the design of a novel multi-agent policy network that can steer a vehicle given the state of the surrounding agents and the map information. The policy network is trained implicitly with ground-truth observation data using backpropagation through time and a differentiable dynamics model to roll out the trajectory forward in time. Finally, we show that our multi-agent policy network learns to drive while interacting with the environment, and, when combined with the game-theoretic MPC planner, can successfully generate interactive behaviors.
Interactive Multi-Agent Prediction (IMAP) Policy (Good Performance Examples)
Interactive Multi-Agent Prediction (IMAP) Policy (Bad Performance Examples)
Interactive Planning for Lane Changes
Example of interactive prediction and planning during a lane change scenario, note how the affected agent is the only one slowing down, the rest of the agents (in gray) are unaffected by the planned trajectory (orange).
Nominal Prediction
Iterative Leader-Follower MPC
Iterative Best-Response MPC
Adversarial ILF-MPC