Can Differentiable Decision Trees Enable Interpretable Reward Learning from Human Feedback?

Akansha Kalra Daniel S. Brown        

 Reinforcement Learning Conference (RLC) 2024

 Motivation