Track 2B: Text-to-Video Generation