PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning with Multimodal Transformers


Dezhong Zhao*, Ruiqi Wang*, Dayoon Suh, Taehyeon Kim,

Ziqin Yuan, Byung-Cheol Min, and Guohua Chen

*:equal contribution

SMART Lab, Purdue University

[Video] [Paper] [Code]