Frozen parameters in GaussianFourierProjection #166

vvvm23 · 2022-08-11T17:51:08Z

Hi, just a beginner with diffusion models and have been using your implementations as reference. I have a question about this class

Why is requires_grad set to false in the weight parameter? Won't this mean, during training, the noise level embeddings won't be updated?

Thanks!

The text was updated successfully, but these errors were encountered:

patil-suraj · 2022-08-12T15:00:40Z

cc @patrickvonplaten

patrickvonplaten · 2022-08-23T12:34:39Z

Hey @vvvm23,

It's set to False because we don't want to train those parameters. I followed the implementaton of the original model here: https://github.com/yang-song/score_sde_pytorch/blob/1618ddea340f3e4a2ed7852a0694a809775cf8d0/models/layerspp.py#L37

Does this make sense?

vvvm23 · 2022-08-26T09:17:10Z

Hi @patrickvonplaten

I somewhat misphrased my original question, I'm aware setting requires_grad to False prevents that particular parameter from accumulating gradients, essentially stopping the training of those parameters.

But why would we not want to train the noise level embeddings? Or is this just a simple, fixed (albeit randomly initialised) projection from a per-batch noise value to a different space, which would later have some learned transformation applied to it?

Thanks!

patrickvonplaten · 2022-08-30T13:49:24Z

Hey @vvvm23,

sinusoidal position features like GaussianFourierProjection don't need training because every embedding already has a distinctly different vector that the model can use a "cue" to know what time position has been passed to it.

If one wants to train position embedding vectors (or time embedding vectors here), one can just randomly initialize such a vector and let the model learn it. If however we use sinusoidal embeddings, there is no need to learn it

vvvm23 · 2022-08-31T08:10:30Z

Okay thank you @patrickvonplaten ! That explanation makes a lot of sense~

vvvm23 closed this as completed Aug 31, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Frozen parameters in GaussianFourierProjection #166

Frozen parameters in GaussianFourierProjection #166

vvvm23 commented Aug 11, 2022

patil-suraj commented Aug 12, 2022

patrickvonplaten commented Aug 23, 2022

vvvm23 commented Aug 26, 2022

patrickvonplaten commented Aug 30, 2022

vvvm23 commented Aug 31, 2022

Frozen parameters in GaussianFourierProjection #166

Frozen parameters in GaussianFourierProjection #166

Comments

vvvm23 commented Aug 11, 2022

patil-suraj commented Aug 12, 2022

patrickvonplaten commented Aug 23, 2022

vvvm23 commented Aug 26, 2022

patrickvonplaten commented Aug 30, 2022

vvvm23 commented Aug 31, 2022