[refactor embeddings]pixart-alpha #6212

yiyixuxu · 2023-12-18T10:06:51Z

part of my embedding refactor, separated by model/pipeline so it is easier to work with

this PR focuses on embeddings only used in Pixar-alpha: i.e. CombinedTimestepSizeEmbeddings and CaptionProjection:

rename them to PixArtAlphaCombinedTimestepSizeEmbeddings and PixArtAlphaTextProjection so it is clear that these embeddings are only used in PixArt-Alpha
remove code that's not needed (let me know if I got anything wrong here)

HuggingFaceDocBuilderDev · 2023-12-18T10:13:33Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

patrickvonplaten · 2023-12-18T10:18:00Z

src/diffusers/models/embeddings.py

-                aspect_ratio, batch_size=batch_size, embedder=self.aspect_ratio_embedder
-            )
-            conditioning = timesteps_emb + torch.cat([resolution, aspect_ratio], dim=1)
+            resolution_emb = self.additional_condition_proj(resolution.flatten()).to(hidden_dtype)


very nice refactor!

sayakpaul

LGTM if the SLOW tests pass. Could you please run the slow tests with these changes as well?

sayakpaul · 2023-12-18T10:22:17Z

src/diffusers/pipelines/pixart_alpha/pipeline_pixart_alpha.py

+            if do_classifier_free_guidance:
+                resolution = torch.cat([resolution, resolution], dim=0)
+                aspect_ratio = torch.cat([aspect_ratio, aspect_ratio], dim=0)


Seems like a new addition?

not really - it gets duplicated later inside embedding

diffusers/src/diffusers/models/embeddings.py

Line 758 in 6976cab

if size.shape[0] != batch_size:

Very nice <3

@sayakpaul
fast tests fail because there are some randomly initialized weights in some components. I think we need to put torch.manual_seed(0) before making each component e.g.

diffusers/tests/pipelines/pixart/test_pixart.py

Line 67 in 6976cab

vae = AutoencoderKL()

should we open a new PR to only update the tests, and I rebase after that? I'm not comfortable updating tests directly from this PR since I updated the code

But that wasn't the case before. Wonder what changed.

DN6 · 2023-12-18T18:20:55Z

src/diffusers/models/transformer_2d.py

@@ -235,7 +235,7 @@ def __init__(

        self.caption_projection = None
        if caption_channels is not None:
-            self.caption_projection = CaptionProjection(in_features=caption_channels, hidden_size=inner_dim)
+            self.caption_projection = PixArtAlphaTextProjection(in_features=caption_channels, hidden_size=inner_dim)


Might actually be worth breaking up Transformer2D up into a dedicated one for PixArt.

For a future PR, yeah? I am happy to work on it once this is merged.

Definitely for a future PR

But I think we should refactor transformers and UNet after we clean up all the lower-level classes and make such decisions for all models/pipelines at once so it will be consistent

Yes 100 percent!

pixart-alpha Co-authored-by: yiyixuxu <yixu310@gmail,com>

pixart-alpha

93e3978

yiyixuxu added the refactor label Dec 18, 2023

patrickvonplaten reviewed Dec 18, 2023

View reviewed changes

patrickvonplaten approved these changes Dec 18, 2023

View reviewed changes

patrickvonplaten requested a review from sayakpaul December 18, 2023 10:18

sayakpaul reviewed Dec 18, 2023

View reviewed changes

sayakpaul mentioned this pull request Dec 18, 2023

fix: init for vae during pixart tests #6215

Merged

DN6 reviewed Dec 18, 2023

View reviewed changes

Merge branch 'main' into refactor-embeddings

f17b65b

yiyixuxu merged commit 3e71a20 into main Dec 19, 2023

yiyixuxu deleted the refactor-embeddings branch December 19, 2023 17:07

yoonseokjin pushed a commit to yoonseokjin/diffusers that referenced this pull request Dec 25, 2023

[refactor embeddings]pixart-alpha (huggingface#6212)

dcb7c7d

pixart-alpha Co-authored-by: yiyixuxu <yixu310@gmail,com>

donhardman pushed a commit to donhardman/diffusers that referenced this pull request Dec 29, 2023

[refactor embeddings]pixart-alpha (huggingface#6212)

58780de

pixart-alpha Co-authored-by: yiyixuxu <yixu310@gmail,com>

sayakpaul mentioned this pull request Jan 8, 2024

refactor: extract init/forward function in UNet2DConditionModel #6478

Merged

17 tasks

AmericanPresidentJimmyCarter pushed a commit to AmericanPresidentJimmyCarter/diffusers that referenced this pull request Apr 26, 2024

[refactor embeddings]pixart-alpha (huggingface#6212)

228c68b

pixart-alpha Co-authored-by: yiyixuxu <yixu310@gmail,com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[refactor embeddings]pixart-alpha #6212

[refactor embeddings]pixart-alpha #6212

Uh oh!

yiyixuxu commented Dec 18, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Dec 18, 2023

Uh oh!

patrickvonplaten Dec 18, 2023

Uh oh!

sayakpaul left a comment

Uh oh!

sayakpaul Dec 18, 2023

Uh oh!

yiyixuxu Dec 18, 2023 •

edited

Loading

Uh oh!

sayakpaul Dec 18, 2023

Uh oh!

yiyixuxu Dec 18, 2023

Uh oh!

sayakpaul Dec 18, 2023

Uh oh!

sayakpaul Dec 18, 2023

Uh oh!

DN6 Dec 18, 2023

Uh oh!

sayakpaul Dec 19, 2023

Uh oh!

yiyixuxu Dec 19, 2023 •

edited

Loading

Uh oh!

sayakpaul Dec 19, 2023

Uh oh!

Uh oh!

[refactor embeddings]pixart-alpha #6212

[refactor embeddings]pixart-alpha #6212

Uh oh!

Conversation

yiyixuxu commented Dec 18, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Dec 18, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yiyixuxu Dec 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yiyixuxu Dec 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

yiyixuxu Dec 18, 2023 •

edited

Loading

yiyixuxu Dec 19, 2023 •

edited

Loading