A Guidance for Reward Shaping and Reward Design in Value-Based DRL


Hao Sun, Lei Han, Rui Yang, Xiaoteng Ma, Jian Guo, Bolei Zhou

hs789@cam.ac.uk