[df-if II] add additional input checks to ensure the input is divisible by 8 #7844

bghira · 2024-05-02T16:23:23Z

What does this PR do?

Adds logic to check_inputs for the IF SuperResolution pipeline so that the user receives a clear error when attempting to run the pipeline with invalid image sizes for the input.

This is possible to hit when using the super-resolution model for upscaling evaluation images during training, if eg. the target 256 pixel resolution is aligned to 8px intervals and then divided by 4 to obtain the input image size. The stage II output resolution will be okay, but the input resolution would be wrong.

I suppose there's other ways to hit the problem, but it's always been a bit murky which input is causing the problems.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Pipelines: @sayakpaul @yiyixuxu @DN6

…ivisible by 8

yiyixuxu · 2024-05-03T20:59:16Z

ohh thanks for looking into this!
What we usually do with our image inputs is that in the preprocessing step we resize it to default height and width that are divisible by 8

diffusers/src/diffusers/image_processor.py

Line 407 in 5823736

def get_default_height_width(

so I think instead of adding the checks, we should just resize it, we can either adding the resize step to the preprocess_image method for the IF pipeline, or we can just refactor the method with the VaeImageProcessor like what we do in rest of the pipelines

diffusers/src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_img2img.py

Line 292 in 5823736

    
           self.image_processor = VaeImageProcessor(vae_scale_factor=self.vae_scale_factor)

what do you think?

bghira · 2024-05-03T22:33:31Z

i considered it. but because of the nature of this, i didn't really feel comfortable just squishing images on the users' behalf. with the small resolution of the inputs, it really can be noticeably distorted, whereas with SD and SDXL at 512/768/1024px it's far less destructive to adjust the size @yiyixuxu how do you feel about that mindset applied to a 64px model, where it might be somewhere around ~5-7% of the image size we end up adjusting by?

yiyixuxu

thanks!

HuggingFaceDocBuilderDev · 2024-05-06T05:13:54Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

bghira · 2024-05-07T01:11:03Z

@yiyixuxu i notice the quality checks failed because of some unnecessary list comprehension. but when i look at it, it seems like the most reasonable way to do it? is there a better way? i would love to learn 😁

src/diffusers/pipelines/deepfloyd_if/pipeline_if_superresolution.py

…on.py Co-authored-by: YiYi Xu <yixu310@gmail.com>

yiyixuxu · 2024-05-08T21:17:26Z

can you run make style again?

bghira · 2024-05-11T13:57:36Z

@yiyixuxu done

yiyixuxu · 2024-05-12T17:18:51Z

src/diffusers/pipelines/deepfloyd_if/pipeline_if_superresolution.py

@@ -543,12 +543,27 @@ def check_inputs(

        if isinstance(image, list):
            image_batch_size = len(image)
+            # Check that each image is the same size:


I think it is better to do this in a separate code block:
So we keep this section as it is to check image_batch_size

and then

if isinstance(image, list): check_image_size = image[0] else: check_image_size = image if isinstance(check_image_size, PIL.Image.Image): image_size = check_image_size.size elif isinstance(check_image_size, torch.Tensor): image_size = check_image_size.shape[2:] elif isinstanc(..., np.ndarray): image_size = check_image.shape[:1] if image_size ....: raise ValueError(...)

The current code does not work for list of array or tensors

yiyixuxu · 2024-05-12T17:19:06Z

tests/pipelines/deepfloyd_if/test_if_superresolution.py

+        image = floats_tensor((1, 3, 31, 31), rng=random.Random(0)).to(torch_device)
+        generator = torch.Generator(device="cpu").manual_seed(0)
+        with self.assertRaises(ValueError):
+            self.pipeline(


can we make sure this test works?

i can't run the test suite locally, i was waiting for it to run on the workflow here

oh so these are the tests that failed https://github.com/huggingface/diffusers/actions/runs/9044175339/job/24855064736#step:7:15620
I can trigger them again now but I think the results would be the same

bghira and others added 3 commits May 2, 2024 10:16

huggingface#7842 add additional input checks to ensure the input is d…

ffaa08f

…ivisible by 8

add a superresolution pipeline regression test for odd input sizes

ea22092

Merge branch 'main' into issue/7842

95bba60

Merge branch 'main' into issue/7842

2c1cc63

yiyixuxu approved these changes May 3, 2024

View reviewed changes

Merge branch 'main' into issue/7842

96290b6

Merge branch 'main' into issue/7842

84ae72a

yiyixuxu reviewed May 8, 2024

View reviewed changes

src/diffusers/pipelines/deepfloyd_if/pipeline_if_superresolution.py Outdated Show resolved Hide resolved

bghira and others added 2 commits May 7, 2024 22:04

Update src/diffusers/pipelines/deepfloyd_if/pipeline_if_superresoluti…

69b5d38

…on.py Co-authored-by: YiYi Xu <yixu310@gmail.com>

Merge branch 'main' into issue/7842

a98ee37

bghira and others added 2 commits May 11, 2024 07:55

Merge branch 'main' into issue/7842

7fd4782

make style

b7973d6

Merge branch 'main' into issue/7842

fe48c23

yiyixuxu reviewed May 12, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[df-if II] add additional input checks to ensure the input is divisible by 8 #7844

[df-if II] add additional input checks to ensure the input is divisible by 8 #7844

bghira commented May 2, 2024 •

edited

yiyixuxu commented May 3, 2024

bghira commented May 3, 2024

yiyixuxu left a comment

HuggingFaceDocBuilderDev commented May 6, 2024

bghira commented May 7, 2024

yiyixuxu commented May 8, 2024

bghira commented May 11, 2024

yiyixuxu May 12, 2024

yiyixuxu May 12, 2024

bghira May 12, 2024

yiyixuxu May 12, 2024

[df-if II] add additional input checks to ensure the input is divisible by 8 #7844

Are you sure you want to change the base?

[df-if II] add additional input checks to ensure the input is divisible by 8 #7844

Conversation

bghira commented May 2, 2024 • edited

What does this PR do?

Before submitting

Who can review?

yiyixuxu commented May 3, 2024

bghira commented May 3, 2024

yiyixuxu left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented May 6, 2024

bghira commented May 7, 2024

yiyixuxu commented May 8, 2024

bghira commented May 11, 2024

yiyixuxu May 12, 2024

Choose a reason for hiding this comment

yiyixuxu May 12, 2024

Choose a reason for hiding this comment

bghira May 12, 2024

Choose a reason for hiding this comment

yiyixuxu May 12, 2024

Choose a reason for hiding this comment

bghira commented May 2, 2024 •

edited