Paper Review: Diffusion Model Alignment Using Direct Preference Optimization
27 November 2023
Adapting DPO from language models to image generation — training Stable Diffusion XL on 851K human preferences to significantly improve visual appeal and prompt alignment.