Real-World Offline Reinforcement Learning from Vision Language Model Feedback
Sreyas Venkataraman∗, Yufei Wang∗, Ziyu Wang, Zackory Erickson†, David Held†
Submission to ICRA 2025
*Equal Contribution; †Equal Advising
Sreyas Venkataraman∗, Yufei Wang∗, Ziyu Wang, Zackory Erickson†, David Held†
Submission to ICRA 2025
*Equal Contribution; †Equal Advising
Offline RL-VLM-F (ours)
DP3 baseline
Offline RL-VLM-F (ours)
DP3 baseline
Offline RL-VLM-F (ours)
DP3 baseline
Offline RL-VLM-F (ours)
DP3 baseline
Offline RL-VLM-F (ours)
DP3 baseline
Offline RL-VLM-F (ours)
DP3 baseline