Reward Fine-Tuning Two-Step Diffusion Models via Learning Differentiable Latent-Space Surrogate Reward


Zhiwei Jia, Yuesong Nan, Huixi Zhao, Gengdai Liu

Zoom Communications

CVPR 2025

[arxiv]