Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation

Authors: Jiaming Liu 1, Chenxuan Li 1🍩, Guanqun Wang 1🍩, Lily Lee 1, Kaichen Zhou 1, 

Sixiang Chen 1, Chuyan Xiong, Jiaxin Ge 1, Renrui Zhang, Shanghang Zhang 1🍭

🍩:Equal technical contribution; 🍭:Corresponding author


Affiliation: 1) National Key Laboratory for Multimedia Information Processing, 

School of Computer Science, Peking University;  

Main  contributions:

Close loop correction:

Failure case example

Successful correction example