[Stable Diffusion Inpainting] Allow standard text-to-img checkpoints to be useable for SD inpainting #3533

patrickvonplaten · 2023-05-23T20:09:52Z

This PR allows to use StableDiffusionControlNetInpaintPipeline and StableDiffusionInpaintPipeline for both inpainting models and normal text-to-image models. This has two purposes:

1.)

We can completely remove **StableDiffusionInpaintLegacyPipeline**. People will still want to use one and the same model weights for text2img, img2img, and inpainting so we need to allow the following scenario:

from diffusers import (
    StableDiffusionPipeline,
    StableDiffusionImg2ImgPipeline,
    StableDiffusionInpaintPipeline,
)

text2img = StableDiffusionPipeline.from_pretrained("runwayml/stable-diffusion-v1-5")
img2img = StableDiffusionImg2ImgPipeline(**text2img.components)
inpaint = StableDiffusionInpaintPipeline(**text2img.components)

It's also stated in our official docs, here:
https://huggingface.co/docs/diffusers/main/en/api/diffusion_pipeline#diffusers.DiffusionPipeline.components.example

With this PR the standard inpainting pipeline is augmented to allow for inpainting with sd-v1-5 (the text2image checkpoint, not the inpainting) which I think removes some barries. It was also very much a requested feature:

2.)

We need this PR to fully support the ControlNet inpainting model: https://huggingface.co/lllyasviel/control_v11p_sd15_inpaint

This checkpoint works very well with the new inpaint pipeline:

# !pip install transformers accelerate
from diffusers import StableDiffusionControlNetInpaintPipeline, ControlNetModel, DDIMScheduler
from diffusers.utils import load_image
import numpy as np
import torch

init_image = load_image(
    "https://huggingface.co/datasets/diffusers/test-arrays/resolve/main/stable_diffusion_inpaint/boy.png"
)
init_image = init_image.resize((512, 512))

generator = torch.Generator(device="cpu").manual_seed(1)

mask_image = load_image(
    "https://huggingface.co/datasets/diffusers/test-arrays/resolve/main/stable_diffusion_inpaint/boy_mask.png"
)
mask_image = mask_image.resize((512, 512))


def make_inpaint_condition(image, image_mask):
    image = np.array(image.convert("RGB")).astype(np.float32) / 255.0
    image_mask = np.array(image_mask.convert("L")).astype(np.float32) / 255.0

    assert image.shape[0:1] == image_mask.shape[0:1], "image and image_mask must have the same image size"
    image[image_mask > 0.5] = -1.0  # set as masked pixel
    image = np.expand_dims(image, 0).transpose(0, 3, 1, 2)
    image = torch.from_numpy(image)
    return image


control_image = make_inpaint_condition(init_image, mask_image)

controlnet = ControlNetModel.from_pretrained(
    "lllyasviel/control_v11p_sd15_inpaint", torch_dtype=torch.float16
)
pipe = StableDiffusionControlNetInpaintPipeline.from_pretrained(
    "runwayml/stable-diffusion-v1-5", controlnet=controlnet, torch_dtype=torch.float16
)

# speed up diffusion process with faster scheduler and memory optimization
pipe.scheduler = DDIMScheduler.from_config(pipe.scheduler.config)

pipe.enable_model_cpu_offload()

# generate image
image = pipe(
    "a handsome man with ray-ban sunglasses",
    num_inference_steps=20,
    generator=generator,
    guidance_scale=9.0,
    eta=1.0,
    image=init_image,
    mask_image=mask_image,
    control_image=control_image,
).images[0]

becomes:

Also, I also played around with other SAM to generate masks and then use our SAM ControlNet model: https://colab.research.google.com/drive/1D6mBtne_m-3E9R-cl_ZCRY87Fq1OjkyI?usp=sharing

cc @sayakpaul this could be useful for a diffusers tool, but had very limited success here tbh.
=> very limited success, the model doesn't seem to work super well. It seems like we either need a ControlNet checkpoint that is purely made for inpainting or use the 9-channel inpainting models.

HuggingFaceDocBuilderDev · 2023-05-23T20:16:13Z

The documentation is not available anymore as the PR was closed or merged.

wangdong2023 · 2023-05-25T01:04:27Z

nice, was looking for this functionality yesterday.

patrickvonplaten · 2023-05-25T19:14:07Z

src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_inpaint_legacy.py

@@ -137,6 +136,13 @@ def __init__(
    ):
        super().__init__()

+        deprecation_message = (


Let's fully deprecate this class. Now "normal" inpaint has all features that are needed.

patrickvonplaten · 2023-05-25T19:22:11Z

Will update README of: https://huggingface.co/lllyasviel/control_v11p_sd15_inpaint once this PR is merged

williamberman

lgtm!

…e_sigma when pure noise updated this commit w.r.t the latest merge here: huggingface#3533

* Throw error if strength adjusted num_inference_steps < 1 * Added new fast test to check ValueError raised when num_inference_steps < 1 when strength adjusts the num_inference_steps then the inpainting pipeline should fail * fix #3487 initial latents are now only scaled by init_noise_sigma when pure noise updated this commit w.r.t the latest merge here: #3533 * fix --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

rb-synth · 2023-07-11T11:52:13Z

Looking at the example, the control image is just the masked original. How can I use e.g., normalbae with inpainting?

ksai2324 · 2023-07-14T14:59:46Z

I tried using the pipeline with lllyasviel/sd-controlnet-canny , but it didn't work. It just delivered a random noise in the masked part.
I'm still trying to understand why.
@rb-synth: did you manage to run it with normalbae?

…to be useable for SD inpainting (huggingface#3533) * Add default to inpaint * Make sure controlnet also works with normal sd for inpaint * Add tests * improve * Correct encode images function * Correct inpaint controlnet * Improve text2img inpanit * make style * up * up * up * up * fix more

…gface#3532) * Throw error if strength adjusted num_inference_steps < 1 * Added new fast test to check ValueError raised when num_inference_steps < 1 when strength adjusts the num_inference_steps then the inpainting pipeline should fail * fix huggingface#3487 initial latents are now only scaled by init_noise_sigma when pure noise updated this commit w.r.t the latest merge here: huggingface#3533 * fix --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

…to be useable for SD inpainting (huggingface#3533) * Add default to inpaint * Make sure controlnet also works with normal sd for inpaint * Add tests * improve * Correct encode images function * Correct inpaint controlnet * Improve text2img inpanit * make style * up * up * up * up * fix more

…gface#3532) * Throw error if strength adjusted num_inference_steps < 1 * Added new fast test to check ValueError raised when num_inference_steps < 1 when strength adjusts the num_inference_steps then the inpainting pipeline should fail * fix huggingface#3487 initial latents are now only scaled by init_noise_sigma when pure noise updated this commit w.r.t the latest merge here: huggingface#3533 * fix --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

patrickvonplaten added 6 commits May 23, 2023 20:54

Add default to inpaint

8b729ad

Make sure controlnet also works with normal sd for inpaint

0ca6f74

Add tests

0ecac31

improve

4ed7e89

Correct encode images function

c9564be

Correct inpaint controlnet

1c03c60

patrickvonplaten added 3 commits May 24, 2023 14:01

Improve text2img inpanit

fd83e1d

make style

5ca6b79

up

2ae1912

patrickvonplaten added 4 commits May 25, 2023 10:01

up

c0179d4

up

25db6e6

up

3f1889e

fix more

d9cd074

patrickvonplaten commented May 25, 2023

View reviewed changes

This was referenced May 25, 2023

ControlNet v1.1 #3095

Closed

'StableDiffusionInpaintPipeline' has no attribute 'from_ckpt' #3497

Closed

patrickvonplaten requested review from pcuenca, williamberman and sayakpaul May 25, 2023 19:21

williamberman approved these changes May 25, 2023

View reviewed changes

patrickvonplaten merged commit d114d80 into main May 26, 2023
9 checks passed

patrickvonplaten deleted the add_default_sd_to_inpaint branch May 26, 2023 08:47

patrickvonplaten mentioned this pull request May 26, 2023

#3487 Fix inpainting strength for various samplers #3532

Merged

rupertmenneer pushed a commit to rupertmenneer/diffusers that referenced this pull request May 26, 2023

fix huggingface#3487 initial latents are now only scaled by init_nois…

c66604b

…e_sigma when pure noise updated this commit w.r.t the latest merge here: huggingface#3533

patrickvonplaten mentioned this pull request May 30, 2023

community controlnet inpainting pipelines #2561

Merged

patrickvonplaten mentioned this pull request Jun 5, 2023

Create a img2img (+ inpainting) controlnet pipeline #2783

Closed

aycaecemgul mentioned this pull request Jun 7, 2023

Load StableDiffusionInpaintPipeline using from_ckpt #3704

Closed

patrickvonplaten mentioned this pull request Jun 12, 2023

[Stable Diffusion Inpaint & ControlNet inpaint] Correct timestep inpaint #3749

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Stable Diffusion Inpainting] Allow standard text-to-img checkpoints to be useable for SD inpainting #3533

[Stable Diffusion Inpainting] Allow standard text-to-img checkpoints to be useable for SD inpainting #3533

patrickvonplaten commented May 23, 2023 •

edited

HuggingFaceDocBuilderDev commented May 23, 2023 •

edited

wangdong2023 commented May 25, 2023

patrickvonplaten May 25, 2023

patrickvonplaten commented May 25, 2023 •

edited

williamberman left a comment

rb-synth commented Jul 11, 2023

ksai2324 commented Jul 14, 2023

[Stable Diffusion Inpainting] Allow standard text-to-img checkpoints to be useable for SD inpainting #3533

[Stable Diffusion Inpainting] Allow standard text-to-img checkpoints to be useable for SD inpainting #3533

Conversation

patrickvonplaten commented May 23, 2023 • edited

1.)

2.)

HuggingFaceDocBuilderDev commented May 23, 2023 • edited

wangdong2023 commented May 25, 2023

patrickvonplaten May 25, 2023

Choose a reason for hiding this comment

patrickvonplaten commented May 25, 2023 • edited

williamberman left a comment

Choose a reason for hiding this comment

rb-synth commented Jul 11, 2023

ksai2324 commented Jul 14, 2023

patrickvonplaten commented May 23, 2023 •

edited

HuggingFaceDocBuilderDev commented May 23, 2023 •

edited

patrickvonplaten commented May 25, 2023 •

edited