Learning Structured Representations of Spatial and Interactive Dynamics for Trajectory Prediction in Crowded Scenes