Offline Reinforcement Learning from Datasets with Structured Non-Stationarity

Johannes Ackermann , Takayuki Osa , Masashi Sugiyama

The University of Tokyo,  RIKEN AIP