Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Scribble-guided editing #8

Closed
wileewang opened this issue Jun 14, 2022 · 2 comments
Closed

Scribble-guided editing #8

wileewang opened this issue Jun 14, 2022 · 2 comments

Comments

@wileewang
Copy link

Hi! I wonder if a loss such as MSE or LPIPS is used between the user-provided scribbles and the scribbled regions of $\widehat{x}_0$ , in addition to the CLIP loss. I am curious how the shapes and colors stay consistent when only text with no specific description, e.g., "blanket" in Fig 9, is given.

@omriav
Copy link
Owner

omriav commented Jun 14, 2022

Hi,

Thank you for your interest in our work.
No - there is no need for MSE/LPIPS loss, the only signal for the scribbles comes from the partial nosing of the image (i.e. to noise the image to a certain noise level).
The shapes and the colors stay somewhat consistent because of the why the diffusion model operates - the initial stages generate a rough sketch of the image and the finer details are added later, so we can noise the image up to the point that preserves the colors/shapes. For more details please see Figure 32 in the paper.

@wileewang
Copy link
Author

I see. Thanks for your reminding.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants