SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual Datasets
Shenghua Wan Ziyuan Chen Le Gan Shuai Feng De-Chuan Zhan
The task-relevant observation reconstructions
(from top to bottom: raw observation, reconstruction, difference)
SeMOPO (Ours)
Separated Models + RS
Separated Models + RS + DRP