Skip to content
/ h-edit Public

[CVPR 2025] h-Edit: Effective and Flexible Diffusion-Based Editing via Doob’s h-Transform

License

Notifications You must be signed in to change notification settings

nktoan/h-edit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

62 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

h-Edit: Effective and Flexible Diffusion-Based Editing via Doob’s h-Transform (CVPR'25)

Summary

h-Edit is a finetuning-free diffusion-based editing method that frames editing as a reverse-time bridge modeling problem. It leverages Doob’s h-Transform for bridge construction and Langevin Monte Carlo sampling for generating edited samples.

🔥 Key Features

Theoretical Guarantee - Provides both explicit and implicit forms with unique features. Math doesn't lie! 📏
🚀 Training-Free, Simple, General - Smarter edits, zero headaches! 🧠
🏆 Strong Performance, SOTA on PieBench - Tackles tough cases like a champ! 💪
🛠️ Flexible - Supports conditional scores, external reward models; the first to handle both simultaneously! 🎛️
🎯 Compatible - Works with deterministic/random inversion, P2P, MasaCtrl and Plug-n-Play or even without attention control! 🔄
🔌 Plug-and-Play - Just add a pretrained diffusion model, whether for images, text, audio, or graphs, and you're all set! ✨

🏗️ Overview of Implicit h-Edit

Overview of h-edit

🔬 Experiments

We showcase h-Edit's capabilities in three settings: text-guided editing (conditional scores), face swapping (external reward models), and combined text-guided & style editing (both). Each experiment is linked below.

📢 📢 📢 If h-Edit helps your work, we’d love your feedback! Please cite our paper and giving us a ⭐ - it means a lot! 🚀

Important

If this repository is useful for your work, please cite it:

@article{nguyen2025hedit,
 title={h-Edit: Effective and Flexible Diffusion-Based Editing via Doob's h-Transform},
 author={Nguyen, Toan and Do, Kien and Kieu, Duc and Nguyen, Thin},
 journal={arXiv preprint arXiv:2503.02187},
 year={2025}
}

🏆 Notable Results

📊 SOTA on PieBench

📝 Text-Guided Editing Visualizations

👥 Face Swapping Visualizations

🎨 Combined Text-Guided & Style Editing Visualizations

📌 To-Do List

  • Release code for Face Swapping
  • Release code for Combined Text-Guided & Style Editing
  • Develop Webpage
  • Deploy on Hugging Face
  • Build App Demo

Releases

No releases published

Packages

No packages published