V-PTR
V-PTR
Robotic Offline RL from Internet Videos
Robotic Offline RL from Internet Videos
via Temporal-Difference Learning
via Temporal-Difference Learning
Pre-trains on internet video data with value-based RL to get functional representations to understand possible future outcomes
Pre-trains on diverse robot data to understand the actions that lead to future outcomes
Fine-tunes on task-specific robot demos to understand how to complete specific tasks
Results + Rollouts
V-PTR (ours)
PTR
Masked Visual Pre-training
VIP
R3M