References

(will be more updated soon)

Courses & Textbooks

Online Learning

Bandits

Coactive Learning

Behavioral Cloning

More Imitation Learning

Inverse reinforcement learning

Basic Reinforcement Learning

Sparse Feedback in RL

Learning + Control

Safe Reinforcement Learning

Constrained Policy Search in Reinforcement Learning

Multi-Task & Transfer in RL & IL

Off-policy learning

Monte Carlo Tree Search

  • A Survey of Monte Carlo Tree Search Methods by Cameron Browne, Edward Powley, Daniel Whitehouse, Simon Lucas, Peter I. Cowling, Philipp Rohlfshagen, Stephen Tavener, Diego Perez, Spyridon Samothrakis and Simon Colton. IEEE Transactions on Computational Intelligence and AI in Games, 4(1), 2012.
  • Applying Monte Carlo Tree Search to Go) Mastering the game of Go with deep neural networks and tree search, by David Silver, Aja Huang, Chris J. Maddison, Arthur Guez, Laurent Sifre, George van den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, Sander Dieleman, Dominik Grewe, John Nham, Nal Kalchbrenner, Ilya Sutskever, Timothy Lillicrap, Madeleine Leach, Koray Kavukcuoglu, Thore Graepel, and Demis Hassabis. Nature, 529, 484–489, doi:10.1038/nature16961, 2016.

Other Forward Search in RL

Theory

Partial-observable RL

Adversarial & Multi-Agent