Paper Lists

Learning from Demonstration (Behavior Cloning)

Multi Armed Bandits

Continuous Black-Box Optimization


Stochastic and Adversarial MAB


Contextual MAB


Dueling MAB


Combinatorial MAB


Reinforcement Learning Theory

Deep Reinforcement Learning

Model-based (Deep) Reinforcement Learning

Distributional Reinforcement Learning

Offline Reinforcement Learning

Meta Reinforcement Learning

Hierarchical Reinforcement Learning


Causal Reinforcement Learning


(Classic) Reinforcement Learning

Constrained Markov Decision Processes


Safe Exploration

Zimmer, Christoph, Mona Meister, and Duy Nguyen-Tuong. "Safe active learning for time-series modeling with gaussian processes." Proceedings of the 32nd International Conference on Neural Information Processing Systems. 2018.

Koller, Torsten, Felix Berkenkamp, Matteo Turchetta, and Andreas Krause. "Learning-based model predictive control for safe exploration." In 2018 IEEE conference on decision and control (CDC), pp. 6059-6066. IEEE, 2018.

Dalal, Gal, Krishnamurthy Dvijotham, Matej Vecerik, Todd Hester, Cosmin Paduraru, and Yuval Tassa. "Safe exploration in continuous action spaces." arXiv preprint arXiv:1801.08757 (2018).

Berkenkamp, Felix, Angela P. Schoellig, and Andreas Krause. "Safe controller optimization for quadrotors with Gaussian processes." 2016 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2016.

Sui, Yanan, Alkis Gotovos, Joel Burdick, and Andreas Krause. "Safe exploration for optimization with Gaussian processes." In International Conference on Machine Learning, pp. 997-1005. PMLR, 2015.

Berkenkamp, Felix, et al. "Safe model-based reinforcement learning with stability guarantees." Proceedings of the 31st International Conference on Neural Information Processing Systems. 2017.