DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models

Ying Fan*, Olivia Watkins, Yuqing Du, Hao Liu, Moonkyung Ryu

Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Kangwook Lee, Kimin Lee*

*Equal technical contribution

Google Research,   University of Wisconsin-Madison,   UC Berkeley

[Paper] [Code