Skip to content

babahui/Progressive-Text-to-Image

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

Python >=3.5 PyTorch >=1.0

Progressive Text-to-Image Diffusion with Soft Latent Direction

The official repository for Progressive Text-to-Image Diffusion with Soft Latent Direction.

News

  • 2023.09 Code will be released coming soon.

Progressive Text-to-Image

framework

Existing text-to-image synthesis approaches struggle with textual prompts involving multiple entities and specified relational directions. We propose to decompose the protracted prompt into a set of short commands, including synthesis, editing and erasing operations, using a Large Language Model (LLM) and progressively generate the image. Our strategy enhances both controllability and fidelity and allows for interactive modifications from user interference at each generation step.

Pipeline

framework

Overview of our unified framework emphasizes progressive synthesis, editing, and erasing. In each progressive step, A random latent (z_t) is directed through the cross-attention map in inverse diffusion. Specifically, we design a soft stimulus loss that evaluates the positional difference between entity attention and the target mask region, leading to a gradient for updating the latent (z_{t-1}^{}) as a latent response. Subsequentially, another forward diffusion pass is applied to denoise (z^{t}), yielding deriving (z^{*}{t-1}). In the latent fusion phase, we transform the previous (i)-th image into a latent code (z^{bg}{t-1}) using DDIM inversion. The blending of (z^{*}{t-1}) with (z^{bg}{t-1}) incorporates a dynamic evolving mask, which starts with a layout box and gradually shifts to cross-attention. Finally, (z^{*}{t-1}) undergoes multiple diffusion reverse steps and results in the ((i+1))-th image.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published