Learning non-Markovian Decision-Making from State-only Sequences
Aoyang Qin*,1,2, Feng Gao3, Qing Li2, Song-Chun Zhu1,2,4, Sirui Xie*,5
1Department of Automation, Tsinghua University
2Beijing Institute for General Artificial Intelligence (BIGAI)
3Department of Statistics, UCLA
4School of Artificial Intelligence, Peking University
5Department of Computer Science, UCLA
* Equal contribution