Non-Parametric Stochastic Policy Gradient with Strategic Retreat for Non-Stationary Environment


Apan Dastider and Mingjie Lin

Autonomous Computing Lab, University of Central Florida

Arxiv Paper Link : https://arxiv.org/abs/2203.14905