Dataset
In order to train a generalizable policy that can leverage diverse datasets, we pick three publicly available navigation datasets that vary in their collection platform, visual sensors, and dynamics. This allows us to train policies that can learn shared representations across these widely varying datasets, and generalize to new environments (both indoors and outdoors) and new robots.
GO Stanford dataset (camera: sperical camera, platform: Turtlebot2, location: buildings in Stanford campus)
RECON dataset (camera: wide FoV camera, platform: Jackal, location: grassy fields)
KITTI odometry dataset (camera: narrow FoV camera, platform: vehicle, location: european city)
Appendix
We provide additional explanations for the masked normalization weight in the geometric-aware objective, our view synthesis, and our navigation system in real environments. The details are also explained in the appendix of the arXiv version.
Masked normalization weight
Process of our view synthesis
Navigation system