M0 = Imitation with Past Dropout
M1 = M0 + Traj Perturbation
M2 = M1 + Environment Losses
M4 = M2 + Imitation Dropout
M0 = Imitation with Past Dropout
M1 = M0 + Traj Perturbation
M2 = M1 + Environment Losses
M4 = M2 + Imitation Dropout
M0 = Imitation with Past Dropout
M1 = M0 + Traj Perturbation
M2 = M1 + Environment Losses
M4 = M2 + Imitation Dropout