🌟 SPDiffusion: Semantic Protection Diffusion Models for Multi-concept Text-to-image Generation

📑 Introduction

SPDiffusion: Semantic Protection Diffusion Models for Multi-concept Text-to-image Generation

Yang Zhang, Rui Zhang, Xuecheng Nie, Haochen Li, Jikun Chen, Yifan Hao, Xin Zhang, Luoqi Liu , Ling Li

📚arXiv

This paper proposes a unified approach to address the challenges of improper attribute binding and concept entanglement. We introduce a novel method, SPDiffusion, which detects concept regions from both cross- and self-attention maps, while safeguarding these regions from interference by irrelevant tokens.

For technical details, please refer to our paper.

🚀 Usage

Requirements

torch>=2.0.1
diffusers>=0.29.0
stanza==1.8.2
nltk==3.8.1

Pipeline

from SPD_Pipeline import SPDiffusionPipeline
import torch
pipe = SPDiffusionPipeline.from_pretrained("SG161222/RealVisXL_V4.0").to("cuda")
generator = torch.Generator(device="cuda").manual_seed(2048)
image=pipe("A red book and a yellow vase",run_sdxl=True,generator=generator,cross_threshold=0.9,self_threshold=0.1).images[0]
image.save("result.png")

Pipeline Parameters

The parameters for the SPDiffusion pipeline are as follows:

prompt text prompt for generation
cross_threshold threshold value for cross attention map
self_threshold threshold value for self attention map
st_step layout keeping and SP-Extration steps
filter_loc layers for SP-Extration
run_sdxl generate original sdxl image

🙏 Acknowledgments

This project builds upon valuable work and resources from the following repositories:

We extend our sincere thanks to the creators of these projects for their contributions to the field and for making their code available. 🙌

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
assets		assets
README.md		README.md
SPD_Pipeline.py		SPD_Pipeline.py
requirements.txt		requirements.txt
result.png		result.png
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🌟 SPDiffusion: Semantic Protection Diffusion Models for Multi-concept Text-to-image Generation

📑 Introduction

🚀 Usage

Requirements

Pipeline

Pipeline Parameters

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🌟 SPDiffusion: Semantic Protection Diffusion Models for Multi-concept Text-to-image Generation

📑 Introduction

🚀 Usage

Requirements

Pipeline

Pipeline Parameters

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages