Including human disturbance for randomizing the distribution over initial states.
The graph on the top right displays the number of the currently active sub-policy. On the bottom left, we display various parameters describing the system, including the currently active task (with a randomly chosen sequence displayed in each video). The videos elaborate on task decomposition into individual components as well as reuse of components across tasks.
All visualizations purely focus on the performance of the hierarchical models presented in the corresponding submission.
The complete paper including appendix (missing from the RSS 2020 proceedings) can be found under https://arxiv.org/abs/1906.11228