Maximum Entropy Heterogeneous-Agent Mirror Learning