Home

The Role of Pretrained Representations for the OOD Generalization of RL Agents

Frederik Träuble*, Andrea Dittadi*, Manuel Wüthrich, Felix Widmaier, Peter Gehler, Ole Winther, Francesco Locatello, Olivier Bachem, Bernhard Schölkopf, Stefan Bauer

[Paper]

Building sample-efficient agents that generalize out-of-distribution (OOD) in real-world settings remains a fundamental unsolved problem on the path towards achieving higher-level cognition. One particularly promising approach is to begin with low-dimensional, pretrained representations of our world, which should facilitate efficient downstream learning and generalization. By training 240 representations and over 10,000 reinforcement learning (RL) policies on a simulated robotic setup, we evaluate to what extent different properties of pretrained VAE-based representations affect the OOD generalization of downstream agents. We observe that many agents are surprisingly robust to realistic distribution shifts, including the challenging sim-to-real case. In addition, we find that the generalization performance of a simple downstream proxy task reliably predicts the generalization performance of our RL agents under a wide range of OOD settings. Such proxy tasks can thus be used to select pretrained representations that will lead to agents that generalize.

Object Reaching

in-distribution (train) cube colors

OOD1 cube colors

OOD2 cube colors

Pushing

in-distribution (train) cube colors

OOD1 cube colors

OOD2 cube colors

Object Reaching sim-to-real (OOD2-real)

Pushing sim-to-real (OOD2-real)

Object Reaching sim-to-real with novel object

Citation

@inproceedings{dittadi2021role,

title={The Role of Pretrained Representations for the OOD Generalization of RL Agents},

author={Dittadi, Andrea and Tr{\"a}uble, Frederik and W{\"u}thrich, Manuel and Widmaier, Felix and Gehler, Peter and Winther, Ole and Locatello, Francesco and Bachem, Olivier and Sch{\"o}lkopf, Bernhard and Bauer, Stefan},

booktitle={International Conference on Learning Representations},

year={2022}

}

Page updated

Google Sites

Report abuse