DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations (CVPR 2024)

Tianhao Qi*, Shancheng Fang, Yanze Wu✝, Hongtao Xie✉, Jiawei Liu,
Lang Chen, Qian He, Yongdong Zhang

(*Works done during the internship at ByteDance, ✝Project Lead, ✉Corresponding author)

From University of Science and Technology of China and ByteDance.

🔆 Introduction

TL;DR: We propose DEADiff, a generic method facilitating the synthesis of novel images that embody the style of a given reference image and adhere to text prompts.

⭐⭐ Stylized Text-to-Image Generation.

Stylized text-to-image results. Resolution: 512 x 512. (Compressed)

📝 Changelog

[2024.4.3]: 🔥🔥 Release the inference code and pretrained checkpoint.
[2024.3.5]: 🔥🔥 Release the project page.

⏳ TODO

Release the inference code.
Release training data.

⚙️ Setup

conda create -n deadiff python=3.9.2
conda activate deadiff
conda install pytorch==2.0.0 torchvision==0.15.0 torchaudio==2.0.0 pytorch-cuda=11.8 -c pytorch -c nvidia
pip install git+https://github.com/salesforce/LAVIS.git@20230801-blip-diffusion-edit
pip install -r requirements.txt
pip install -e .

💫 Inference

Download the pretrained model from Hugging Face and put it under ./pretrained/.
Run the commands in terminal.

python3 scripts/app.py

The Gradio app allows you to transfer style from the reference image. Just try it for more details.

Prompt: "A curly-haired boy"

Prompt: "A robot"

Prompt: "A motorcycle"

📢 Disclaimer

We develop this repository for RESEARCH purposes, so it can only be used for personal/research/non-commercial purposes.

✈️ Citation

@article{qi2024deadiff,
  title={DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations},
  author={Qi, Tianhao and Fang, Shancheng and Wu, Yanze and Xie, Hongtao and Liu, Jiawei and Chen, Lang and He, Qian and Zhang, Yongdong},
  journal={arXiv preprint arXiv:2403.06951},
  year={2024}
}

📭 Contact

If your have any comments or questions, feel free to contact qth@mail.ustc.edu.cn

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
assets		assets
configs		configs
docs		docs
ldm		ldm
models		models
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assets

assets

configs

configs

docs

docs

ldm

ldm

models

models

scripts

scripts

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

setup.py

setup.py

Repository files navigation

DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations (CVPR 2024)

🔆 Introduction

⭐⭐ Stylized Text-to-Image Generation.

📝 Changelog

⏳ TODO

⚙️ Setup

💫 Inference

📢 Disclaimer

✈️ Citation

📭 Contact

About

Releases

Packages

Languages

License

bytedance/DEADiff

Folders and files

Latest commit

History

Repository files navigation

DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations (CVPR 2024)

🔆 Introduction

⭐⭐ Stylized Text-to-Image Generation.

📝 Changelog

⏳ TODO

⚙️ Setup

💫 Inference

📢 Disclaimer

✈️ Citation

📭 Contact

About

Resources

License

Stars

Watchers

Forks

Languages