CtrlFormer: Learning Transferable State Representation for
Visual Control via Transformer

Explicitly model the attention mechanism between the new task and the old task and input images thus enabling fast transfer of knowledge learned from the old task to the new one.