DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Ying Fan*, Olivia Watkins, Yuqing Du, Hao Liu, Moonkyung Ryu,
Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Kangwook Lee, Kimin Lee*
*Equal technical contribution
Google Research, University of Wisconsin-Madison, UC Berkeley