Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences

Daniel S. Brown, Russell Coleman, Ravi Srinivasan, Scott Niekum

University of Texas at Austin

In Proceedings of the Thirty-seventh International Conference on Machine Learning (ICML) 2020.