Data

We provide the full experiment data. For each experiment a compressed container contains the the folders with the different experimental runs corresponding to different seeds and hyperparameters. Each of these contains the following files:

  • params.json : experiment configuration file specifying the hyperparameters and seeds of the run
  • progress.csv : tabular file containing the measurements and quantitative results that are used to generate the plots in the paper
  • log.txt: log file of the meta-training
  • params.pkl: serialized dictionary containing the trained policy, baseline, and the environment (pickled with joblib)

Benchmark Study of Meta-Policy Search Methods

https://www.dropbox.com/sh/lh6cbhu9yhiex08/AAB1hJ6jpjVYD6pJma4uTEgDa?dl=0

Gradient Variance of Curvature Estimators (DICE vs. LVC)

https://www.dropbox.com/sh/5dlnd7smeulif4e/AABbo_hpmI7lpur62d0HlF9-a?dl=0

Comparison of Initial Sampling Distributions

https://www.dropbox.com/sh/qs9iorsxx3qejrk/AABy_BKbBZc26KQUepifvJ_ma?dl=0

Gradient Update Directions of Meta-RL Formulations