FURTHER

Influencing Long-Term Behavior in

Multiagent Reinforcement Learning

MuJoCo RoboSumo (Al-Shevidat et al., 2018)

MAgent Battle (Zheng et al., 2018)

FURTHER (Red) vs LILI (Green)

FURTHER-MF (Red) vs LILI-MF (Blue)

Summary:

FURTHER is more effective than LILI by employing the limiting view via the average reward formulation.
FURTHER can scale to an environment with complex interactions (e.g., RoboSumo) and a large number of agents (e.g., Battle).

Page updated

Google Sites

Report abuse