Patch-based Object-centric Transformers for Efficient Video Prediction

Wilson Yan, Ryo Okumura, Stephen James, Pieter Abbeel