Ant
HalfCheetah
Hopper
Walker
Humanoid
FetchReach
FetchSlide
FetchPush
FetchPickAndPlace
For each task we see considerable decrease in network size required to achieve at least 90% performance.
Ant
HalfCheetah
Hopper
Walker
Humanoid
FetchReach
FetchSlide
FetchPush
FetchPickAndPlace
For each task reward seems to be correlated with the capacity metric for each MLP.