Learning non-Markovian Decision-Making from State-only Sequences



Aoyang Qin*,1,2, Feng Gao3, Qing Li2, Song-Chun Zhu1,2,4, Sirui Xie*,5 

1Department of Automation, Tsinghua University

2Beijing Institute for General Artificial Intelligence (BIGAI)

3Department of Statistics, UCLA

 4School of Artificial Intelligence, Peking University

5Department of Computer Science, UCLA

* Equal contribution

[paper] [code]