Skip to main content
Andrey Lukyanenko
  • Projects
  • Blog
  • Tags
  • Career
  • Activities
  • About
  • Contact

Tag: dpo

Tag: dpo

2 posts

Nov 27, 2023
Paper Review: Diffusion Model Alignment Using Direct Preference Optimization
Adapting DPO from language models to image generation — training Stable Diffusion XL on 851K human preferences to sig...
paperreview deeplearning cv stablediffusion
Oct 30, 2023
Paper Review: Zephyr: Direct Distillation of LM Alignment
My review of the paper Zephyr Direct Distillation of LM Alignment
paperreview deeplearning nlp llm

← All tags

Andrey Lukyanenko

Machine Learning Engineer at Meta in London. Kaggle Competition Master, Notebook Grandmaster, Google Developer Expert. Polyglot. Writing about applied ML, paper reviews, systems, and learning.

Buy me a tea

Quick Links

  • Blog
  • Projects
  • Career
  • Contact
  • RSS Feed

Connect

© 2026 Andrey Lukyanenko

Type at least 2 characters to search...