MT-VAE: Learning to Generate Multimodal Motion Dynamics