Data
We provide the full experiment data. For each experiment a compressed container contains the the folders with the different experimental runs corresponding to different seeds and hyperparameters. Each of these contains the following files:
- params.json : experiment configuration file specifying the hyperparameters and seeds of the run
- progress.csv : tabular file containing the measurements and quantitative results that are used to generate the plots in the paper
- log.txt: log file of the meta-training
- params.pkl: serialized dictionary containing the trained policy, baseline, and the environment (pickled with joblib)
Benchmark Study of Meta-Policy Search Methods
Benchmark Study of Meta-Policy Search Methods
https://www.dropbox.com/sh/lh6cbhu9yhiex08/AAB1hJ6jpjVYD6pJma4uTEgDa?dl=0
Gradient Variance of Curvature Estimators (DICE vs. LVC)
Gradient Variance of Curvature Estimators (DICE vs. LVC)
https://www.dropbox.com/sh/5dlnd7smeulif4e/AABbo_hpmI7lpur62d0HlF9-a?dl=0
Comparison of Initial Sampling Distributions
Comparison of Initial Sampling Distributions
https://www.dropbox.com/sh/qs9iorsxx3qejrk/AABy_BKbBZc26KQUepifvJ_ma?dl=0
Gradient Update Directions of Meta-RL Formulations
Gradient Update Directions of Meta-RL Formulations