Direct Post-Training Preference Alignment for Multi-Agent Motion Generation Model Using Implicit Feedback from Pre-training Demonstrations


Ran (Thomas) TianKratarth Goel

ICLR 2025, Spotlight Paper