10/26: Distributed multi-body and multi-agent RL
Stuart Russell and Andrew Zimdars, Q-Decomposition for Reinforcement Learning Agents. ICML 2003.
Bhaskara Marthi, Stuart Russell, David Latham, and Carlos Guestrin, Concurrent hierarchical reinforcement learning. IJCAI 2005.
Michael Littman, Markov games as a framework for multi-agent reinforcement learning. ICML 1994.
Michael Littman, Value-function reinforcement learning in Markov games. Journal of Cognitive Systems Research 1, 2001.
Junling Hu and Michael Wellman, Nash Q-Learning for General-Sum Stochastic Games. JMLR 4, 1039-1069, 2003.
Yoav Shoham, Rob Powers, and Trond Grenager, If multi-agent learning is the answer, what is the question? AIJ 171(7), 365-377, 2007.
10/28: Applications of multi-agent RL
Presenter 1:
Max Jaderberg et al., Human-level performance in 3D multiplayer games with population-based reinforcement learning. Science 364 (6443), 859-865, 2019.
Presenter 2:
Jakob Foerster, Francis Song, Edward Hughes, Neil Burch, Iain Dunning, Shimon Whiteson, Matthew Botvinick, Michael Bowling , Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning. ICML 2019.
Adam Lerer, Hengyuan Hu, Jakob Foerster, and Noam Brown, Improving Policies via Search in Cooperative Partially Observable Games. AAAI 2020.