Listwise Reward Estimation for


Offline Preference-based Reinforcement Learning

ICML 2024