Regarding L2 norm clamping in Diffusion Prior #68

xiankgx · 2022-05-06T17:00:43Z

Why do we clamp only during sampling and not during training? Shouldn't they be matching? Please enlighten me.

https://github.com/lucidrains/DALLE2-pytorch/blob/main/dalle2_pytorch/dalle2_pytorch.py#L843-L844

https://github.com/lucidrains/DALLE2-pytorch/blob/main/dalle2_pytorch/dalle2_pytorch.py#L859-L860

https://github.com/lucidrains/DALLE2-pytorch/blob/main/dalle2_pytorch/dalle2_pytorch.py#L885-L900

xiankgx · 2022-05-06T17:03:56Z

Also, here we multiply with a scale without first doing l2norm.

https://github.com/lucidrains/DALLE2-pytorch/blob/main/dalle2_pytorch/dalle2_pytorch.py#L986

which is ok if we use XClip because we are doing l2norm here.

https://github.com/lucidrains/DALLE2-pytorch/blob/main/dalle2_pytorch/dalle2_pytorch.py#L180

But, we are not doing l2norm when using OpenAI CLIP.

https://github.com/lucidrains/DALLE2-pytorch/blob/main/dalle2_pytorch/dalle2_pytorch.py#L274-L275

lucidrains · 2022-05-06T17:06:05Z

@xiankgx good idea! i've added it here 14e63a3 although i think the whole l2norm clamping thing is not proven out yet

lucidrains · 2022-05-06T17:07:54Z

Also, here we multiply with a scale without first doing l2norm.

https://github.com/lucidrains/DALLE2-pytorch/blob/main/dalle2_pytorch/dalle2_pytorch.py#L986

which is ok if we use XClip because we are doing l2norm here.

https://github.com/lucidrains/DALLE2-pytorch/blob/main/dalle2_pytorch/dalle2_pytorch.py#L180

But, we are not doing l2norm when using OpenAI CLIP.

https://github.com/lucidrains/DALLE2-pytorch/blob/main/dalle2_pytorch/dalle2_pytorch.py#L213

https://github.com/lucidrains/DALLE2-pytorch/blob/main/dalle2_pytorch/dalle2_pytorch.py#L213 ohh, this isn't OpenAIClip, it is actually from CoCa https://arxiv.org/abs/2205.01917 , which debuted yesterday. i think it is a better version of clip

however, it is unclear from the CoCa paper whether they l2normed for cosine similarity contrastive learning

in the paper, it seems they use layernorms on both image and text cls tokens, but not sure if the l2norm is present

xiankgx · 2022-05-06T17:08:17Z

Sorry, wrong line quote.

xiankgx · 2022-05-06T17:09:57Z

Lol, don't take my word for it, I'm a newbie in diffusion models.

lucidrains · 2022-05-06T17:11:01Z

newbie

@xiankgx same, i think we all are, except for a few researchers around the world and maybe @crowsonkb lol

you are right! https://github.com/openai/CLIP/blob/main/clip/model.py#L364 they normalized it outside of the encoding functions, let me fix it now 🙏

xiankgx · 2022-05-06T17:11:43Z

Maybe we can ask crowsonkb for advice.

lucidrains · 2022-05-06T17:12:28Z

https://github.com/lucidrains/DALLE2-pytorch/releases/tag/0.1.4

lucidrains closed this as completed May 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regarding L2 norm clamping in Diffusion Prior #68

Regarding L2 norm clamping in Diffusion Prior #68

xiankgx commented May 6, 2022 •

edited

xiankgx commented May 6, 2022 •

edited

lucidrains commented May 6, 2022

lucidrains commented May 6, 2022

xiankgx commented May 6, 2022

xiankgx commented May 6, 2022

lucidrains commented May 6, 2022

xiankgx commented May 6, 2022

lucidrains commented May 6, 2022

Regarding L2 norm clamping in Diffusion Prior #68

Regarding L2 norm clamping in Diffusion Prior #68

Comments

xiankgx commented May 6, 2022 • edited

xiankgx commented May 6, 2022 • edited

lucidrains commented May 6, 2022

lucidrains commented May 6, 2022

xiankgx commented May 6, 2022

xiankgx commented May 6, 2022

lucidrains commented May 6, 2022

xiankgx commented May 6, 2022

lucidrains commented May 6, 2022

xiankgx commented May 6, 2022 •

edited

xiankgx commented May 6, 2022 •

edited