[SDXL Flax] fix SDXL flax init by patrickvonplaten · Pull Request #5187 · huggingface/diffusers

patrickvonplaten · 2023-09-26T13:54:33Z

Make sure Flax init is correct for different model sizes

patrickvonplaten · 2023-09-26T13:56:38Z

+            is_refiner = 5 * self.config.addition_time_embed_dim + self.config.cross_attention_dim == self.config.projection_class_embeddings_input_dim
+            num_micro_conditions = 5 if is_refiner else 6
+
+            text_embeds_dim = self.config.projection_class_embeddings_input_dim - (num_micro_conditions * self.config.addition_time_embed_dim)


2816 - 6 * 256 = 1280 for base and 2560 - 5 * 256 = 1280 for refiner

pcuenca

Awesome! Thanks a lot! 🙌

patrickvonplaten · 2023-09-26T14:54:57Z

        )

+        # scale the initial noise by the standard deviation required by the scheduler
+        latents = latents * scheduler_state.init_noise_sigma


We need to set the init_noise_sigma atfter scaling

Oh yes, great catch!

patrickvonplaten · 2023-09-26T14:59:21Z

@pcuenca I'm now getting almost identical results on CPU with PyTorch vs. Flax for dummy inputs:

Flax

from diffusers import FlaxStableDiffusionXLPipeline
import numpy as np
import jax.numpy as jnp
import jax

path = "hf-internal-testing/tiny-stable-diffusion-xl-pipe"

pipe, params = FlaxStableDiffusionXLPipeline.from_pretrained(path)

prompt = "An astronaut riding a green horse on Mars"
negative_prompt = "ugly"
steps = 3

batch_size, height, width, ch = 1, 32, 32, 4
num_elems = batch_size * height * width * ch
rng = jax.random.PRNGKey(0)
latents = (jnp.arange(num_elems) / num_elems)[:, None, None, None].reshape(batch_size, ch, width, height)

print("latents", np.abs(np.asarray(latents)).sum())

prompt_embeds = pipe.prepare_inputs(prompt)
neg_prompt_ids = pipe.prepare_inputs(negative_prompt)

image = pipe(prompt_embeds, params, rng, neg_prompt_ids=neg_prompt_ids, latents=latents, num_inference_steps=3, output_type="np").images[0]

print(np.abs(np.asarray(image)).sum())

PT

import torch
import numpy as np
from diffusers import StableDiffusionXLPipeline

path = "hf-internal-testing/tiny-stable-diffusion-xl-pipe"

pipe = StableDiffusionXLPipeline.from_pretrained(path)
pipe.unet.set_default_attn_processor()

prompt = "An astronaut riding a green horse on Mars"
neg_prompt = "ugly"
steps = 3

batch_size, height, width, ch = 1, 32, 32, 4
num_elems = batch_size * height * width * ch
latents = (torch.arange(num_elems) / num_elems)[:, None, None, None].reshape(batch_size, ch, width, height)
print("latents", latents.abs().sum())

image = pipe(prompt, negative_prompt=neg_prompt, latents=latents, num_inference_steps=3, output_type="np", guidance_scale=7.5).images[0]

print(np.abs(image).sum())

Getting:
PT: 6237.967
Flax: 6237.9585

* fix SDXL flax init * finish * Fix

fix SDXL flax init

234600c

patrickvonplaten commented Sep 26, 2023

View reviewed changes

patrickvonplaten requested a review from pcuenca September 26, 2023 13:56

finish

2fedbbf

pcuenca approved these changes Sep 26, 2023

View reviewed changes

Fix

08a9dad

patrickvonplaten commented Sep 26, 2023

View reviewed changes

patrickvonplaten merged commit c82f7ba into main Sep 26, 2023

patrickvonplaten deleted the fix_sdxl_flax_init branch September 26, 2023 17:55

yoonseokjin pushed a commit to yoonseokjin/diffusers that referenced this pull request Dec 25, 2023

[SDXL Flax] fix SDXL flax init (huggingface#5187)

c0eca81

* fix SDXL flax init * finish * Fix

AmericanPresidentJimmyCarter pushed a commit to AmericanPresidentJimmyCarter/diffusers that referenced this pull request Apr 26, 2024

[SDXL Flax] fix SDXL flax init (huggingface#5187)

ba36c75

* fix SDXL flax init * finish * Fix

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SDXL Flax] fix SDXL flax init#5187

[SDXL Flax] fix SDXL flax init#5187
patrickvonplaten merged 3 commits into
mainfrom
fix_sdxl_flax_init

patrickvonplaten commented Sep 26, 2023 •

edited

Loading

Uh oh!

patrickvonplaten Sep 26, 2023

Uh oh!

pcuenca left a comment

Uh oh!

patrickvonplaten Sep 26, 2023

Uh oh!

pcuenca Sep 26, 2023

Uh oh!

patrickvonplaten commented Sep 26, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

patrickvonplaten commented Sep 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

patrickvonplaten Sep 26, 2023

Choose a reason for hiding this comment

Uh oh!

pcuenca left a comment

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten Sep 26, 2023

Choose a reason for hiding this comment

Uh oh!

pcuenca Sep 26, 2023

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten commented Sep 26, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

patrickvonplaten commented Sep 26, 2023 •

edited

Loading