vq diffusion classifier free sampling #1294

williamberman · 2022-11-15T07:23:42Z

Adds classifier free sampling to VQ diffusion. This results in significantly better image quality.

The pipeline now has a default guidance_scale of 5.0

Additionally, the ithq dataset uses a learned parameter for the classifier free embeddings. We modify the convert script to add this parameter to the ported model. Weights will have to be reuploaded

Prompts: "teddy bear playing in the pool" and "horse"

Diffusers VQ diffusion with classifier free sampling

Diffusers VQ diffusion without classifier free sampling

Original VQ diffusion implementation with classifier free sampling

Original VQ diffusion implementation without classifier free sampling

HuggingFaceDocBuilderDev · 2022-11-15T07:26:36Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

HuggingFaceDocBuilderDev · 2022-11-15T21:05:16Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

HuggingFaceDocBuilderDev · 2022-11-15T22:22:53Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

HuggingFaceDocBuilderDev · 2022-11-15T22:30:26Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

HuggingFaceDocBuilderDev · 2022-11-15T22:56:22Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

HuggingFaceDocBuilderDev · 2022-11-16T00:08:39Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

HuggingFaceDocBuilderDev · 2022-11-16T00:16:55Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

HuggingFaceDocBuilderDev · 2022-11-16T00:31:29Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

src/diffusers/models/embeddings.py

HuggingFaceDocBuilderDev · 2022-11-16T01:20:53Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

HuggingFaceDocBuilderDev · 2022-11-16T01:32:58Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

williamberman · 2022-11-16T02:59:07Z

tests/pipelines/vq_diffusion/test_vq_diffusion.py

+            "https://huggingface.co/datasets/williamberman/misc/resolve/main"
+            "/vq_diffusion/teddy_bear_pool_classifier_free_sampling.png"


Should be moved to the huggingface testing dataset. FWIW you might have to also regenerate the image because I get a different image on VQDiffusionPipelineIntegrationtests#test_vq_diffusion.

Very cool :-) I'll move it!

williamberman · 2022-11-16T03:02:58Z

src/diffusers/models/embeddings.py

+class LearnedClassifierFreeSamplingEmbeddings(ModelMixin, ConfigMixin):
+    """
+    Utility class for storing learned text embeddings for classifier free sampling
+    """
+
+    @register_to_config
+    def __init__(self, learnable: bool, hidden_size: Optional[int] = None, length: Optional[int] = None):
+        super().__init__()
+
+        self.learnable = learnable
+
+        if self.learnable:
+            assert hidden_size is not None, "learnable=True requires `hidden_size` to be set"
+            assert length is not None, "learnable=True requires `length` to be set"
+
+            embeddings = torch.zeros(length, hidden_size)
+        else:
+            embeddings = None
+
+        self.embeddings = nn.Parameter(embeddings)


Not sure if this is the preferred way to add the learned embeddings to the pipeline. An alternative might be to add the additional vector to the scheduler instead

It's very model specific, so moving it to the pipeline here directly :-)
Think that's a bit cleaner! The model works much better now though - thanks!

patrickvonplaten · 2022-11-16T15:44:58Z

src/diffusers/pipelines/vq_diffusion/pipeline_vq_diffusion.py

@@ -64,6 +65,7 @@ def __init__(
        tokenizer: CLIPTokenizer,
        transformer: Transformer2DModel,
        scheduler: VQDiffusionScheduler,
+        learned_classifier_free_sampling_embeddings: LearnedClassifierFreeSamplingEmbeddings,


That's definitely the right way to do it - it's quite specific to vq-diffusion IMO though, so will move it here :-)

HuggingFaceDocBuilderDev · 2022-11-16T16:14:55Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

HuggingFaceDocBuilderDev · 2022-11-16T16:29:05Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

patrickvonplaten · 2022-11-16T18:02:08Z

Very nice job @williamberman !

* vq diffusion classifier free sampling * correct * uP Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

williamberman marked this pull request as draft November 15, 2022 07:28

williamberman force-pushed the will/vq-diffusion-classifier-free-guidance branch from d0d5beb to 4ee1e06 Compare November 15, 2022 21:01

williamberman force-pushed the will/vq-diffusion-classifier-free-guidance branch from 4ee1e06 to cac658d Compare November 15, 2022 22:19

williamberman force-pushed the will/vq-diffusion-classifier-free-guidance branch from cac658d to f5fbbbe Compare November 15, 2022 22:27

williamberman force-pushed the will/vq-diffusion-classifier-free-guidance branch from f5fbbbe to 8746de8 Compare November 15, 2022 22:52

williamberman force-pushed the will/vq-diffusion-classifier-free-guidance branch from 8746de8 to fe8db41 Compare November 16, 2022 00:04

williamberman force-pushed the will/vq-diffusion-classifier-free-guidance branch from fe8db41 to bfc4459 Compare November 16, 2022 00:13

williamberman force-pushed the will/vq-diffusion-classifier-free-guidance branch from bfc4459 to 40dc3ff Compare November 16, 2022 00:28

williamberman commented Nov 16, 2022

View reviewed changes

src/diffusers/models/embeddings.py Outdated Show resolved Hide resolved

williamberman force-pushed the will/vq-diffusion-classifier-free-guidance branch from 40dc3ff to 10e2ea3 Compare November 16, 2022 01:17

vq diffusion classifier free sampling

08984ab

williamberman force-pushed the will/vq-diffusion-classifier-free-guidance branch from 10e2ea3 to 08984ab Compare November 16, 2022 01:29

williamberman commented Nov 16, 2022

View reviewed changes

williamberman marked this pull request as ready for review November 16, 2022 02:59

williamberman changed the title ~~[wip] vq diffusion classifier free sampling~~ vq diffusion classifier free sampling Nov 16, 2022

williamberman commented Nov 16, 2022

View reviewed changes

patrickvonplaten reviewed Nov 16, 2022

View reviewed changes

correct

d1577c2

uP

dac2ece

patrickvonplaten merged commit f1fcfde into huggingface:main Nov 16, 2022

skirsten mentioned this pull request Nov 16, 2022

[Flax] Fix loading scheduler from subfolder #1319

Merged

yoonseokjin pushed a commit to yoonseokjin/diffusers that referenced this pull request Dec 25, 2023

vq diffusion classifier free sampling (huggingface#1294)

f396d7a

* vq diffusion classifier free sampling * correct * uP Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vq diffusion classifier free sampling #1294

vq diffusion classifier free sampling #1294

williamberman commented Nov 15, 2022 •

edited

HuggingFaceDocBuilderDev commented Nov 15, 2022

HuggingFaceDocBuilderDev commented Nov 15, 2022

HuggingFaceDocBuilderDev commented Nov 15, 2022

HuggingFaceDocBuilderDev commented Nov 15, 2022

HuggingFaceDocBuilderDev commented Nov 15, 2022

HuggingFaceDocBuilderDev commented Nov 16, 2022

HuggingFaceDocBuilderDev commented Nov 16, 2022

HuggingFaceDocBuilderDev commented Nov 16, 2022

HuggingFaceDocBuilderDev commented Nov 16, 2022

HuggingFaceDocBuilderDev commented Nov 16, 2022

williamberman Nov 16, 2022

patrickvonplaten Nov 16, 2022

williamberman Nov 16, 2022

patrickvonplaten Nov 16, 2022

patrickvonplaten Nov 16, 2022

HuggingFaceDocBuilderDev commented Nov 16, 2022

HuggingFaceDocBuilderDev commented Nov 16, 2022

patrickvonplaten commented Nov 16, 2022

		"https://huggingface.co/datasets/williamberman/misc/resolve/main"
		"/vq_diffusion/teddy_bear_pool_classifier_free_sampling.png"

vq diffusion classifier free sampling #1294

vq diffusion classifier free sampling #1294

Conversation

williamberman commented Nov 15, 2022 • edited

Diffusers VQ diffusion with classifier free sampling

Diffusers VQ diffusion without classifier free sampling

Original VQ diffusion implementation with classifier free sampling

Original VQ diffusion implementation without classifier free sampling

HuggingFaceDocBuilderDev commented Nov 15, 2022

HuggingFaceDocBuilderDev commented Nov 15, 2022

HuggingFaceDocBuilderDev commented Nov 15, 2022

HuggingFaceDocBuilderDev commented Nov 15, 2022

HuggingFaceDocBuilderDev commented Nov 15, 2022

HuggingFaceDocBuilderDev commented Nov 16, 2022

HuggingFaceDocBuilderDev commented Nov 16, 2022

HuggingFaceDocBuilderDev commented Nov 16, 2022

HuggingFaceDocBuilderDev commented Nov 16, 2022

HuggingFaceDocBuilderDev commented Nov 16, 2022

williamberman Nov 16, 2022

Choose a reason for hiding this comment

patrickvonplaten Nov 16, 2022

Choose a reason for hiding this comment

williamberman Nov 16, 2022

Choose a reason for hiding this comment

patrickvonplaten Nov 16, 2022

Choose a reason for hiding this comment

patrickvonplaten Nov 16, 2022

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Nov 16, 2022

HuggingFaceDocBuilderDev commented Nov 16, 2022

patrickvonplaten commented Nov 16, 2022

williamberman commented Nov 15, 2022 •

edited