Decentralized network control as Multi-agent reinforcement learning [paper]
Perimeter control via reinforcement learning [paper]
Optimal control trajectory planning [paper]
Diverse planning [paper]
Air-hockey playing robot [paper]
Auto-generation of MDPs [paper]
Planning by backprop [paper]
Constraint generation for policy evaluation [paper]
Reach out for more details