Towards Improving Reward Design in RL: A Reward Alignment Metric for RL Practitioners


Calarina Muslimani¹⁶, Kerrick Johnstonbaugh², Suyog Chandramouli³, Serena Booth⁴, W. Bradley Knox⁵, Matthew E. Taylor¹⁶


1 University of Alberta

2 RLCore

3 Princeton University

4 Brown University 

5 University of Texas at Austin

6 Alberta Machine Intelligence Institute (Amii)

Paper Code