Oracle Inequalities for Model Selection in Offline Reinforcement Learning

Jonathan N. Lee, George Tucker, Ofir Nachum,

Bo Dai, Emma Brunskill

[paper, code]