Lagrangian Perturbation Diffusion Steering: Latent Reinforcement Learning for Generative Policies