Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem 

Maciej Wołczyk*¹   Bartłomiej Cupiał*¹²   Mateusz Ostaszewski³   Michał Bortkiewicz³   Michał Zając⁴
Razvan Pascanu⁵   Łukasz Kuciński¹²⁶ Piotr Miłoś¹²⁶⁷

*Equal contribution   ¹IDEAS NCBR   ²University of Warsaw   ³Warsaw University of Technology   ⁴Jagiellonian University 
⁵Google DeepMind   ⁶Institute of Mathematics, Polish  Academy of Sciences   ⁷deepsense.ai 

 Contact: {maciej.wolczyk, bartlomiej.cupial}@gmail.com

[Paper] [Poster] [Code]