EMA update on CosineCodebook #26

roomo7time · 2022-09-27T07:40:38Z

The original VIT-VQGAN paper does not seem to use EMA update for codebook learning since their codebook is unit-normalized vectors.

Particularly, to my understanding, EMA update does not quite make sense when the encoder outputs and codebook vectors are unit-normalized ones.

What's your take on this? Should we NOT use EMA update with CosineCodebook?

pengzhangzhi · 2022-10-29T08:44:14Z

Would you like to explain why ema does not work for the unit-normalized codebook?

Saltychtao · 2022-11-03T12:38:36Z

I found when using EMA for cosine code book, the l2-norm of the input to the vq module would grow gradually, from 22 -> 20000, leading to growing training loss. Has anyone met this problem?

Saltychtao · 2022-11-23T08:40:56Z

I found when using EMA for cosine code book, the l2-norm of the input to the vq module would grow gradually, from 22 -> 20000, leading to growing training loss. Has anyone met this problem?

In case anyone else has this problem, I add a layernorm layer after the vq_in projection, and the growing norm problem is largely solved.

jzhang38 · 2023-03-09T04:07:47Z

@Saltychtao I also encounter a similar issue. Does vq_in refer to VectorQuantize.project_in?

Saltychtao · 2023-04-18T01:31:16Z

@Saltychtao I also encounter a similar issue. Does vq_in refer to VectorQuantize.project_in?

Yes.

santisy · 2024-05-13T19:59:10Z

I found when using EMA for cosine code book, the l2-norm of the input to the vq module would grow gradually, from 22 -> 20000, leading to growing training loss. Has anyone met this problem?

In case anyone else has this problem, I add a layernorm layer after the vq_in projection, and the growing norm problem is largely solved.

@Saltychtao Hi, just want to make sure that the current vesion of implementation here seems to put one normalization (l2norm) after the project_in. I also encounter the training loss explosion issue somehow at current version

lucidrains · 2024-05-13T20:21:47Z

@santisy want to try turning this on (following @Saltychtao 's solution)

let me know if it helps

liangcl0928 mentioned this issue Jun 13, 2023

Crash on Mac M1/M2 chip when using MPS support #55

Closed

lucidrains added a commit that referenced this issue May 13, 2024

address #26 (comment)

b38a4c2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EMA update on CosineCodebook #26

EMA update on CosineCodebook #26

roomo7time commented Sep 27, 2022 •

edited

pengzhangzhi commented Oct 29, 2022 •

edited

Saltychtao commented Nov 3, 2022

Saltychtao commented Nov 23, 2022

jzhang38 commented Mar 9, 2023

Saltychtao commented Apr 18, 2023

santisy commented May 13, 2024 •

edited

lucidrains commented May 13, 2024 •

edited

EMA update on CosineCodebook #26

EMA update on CosineCodebook #26

Comments

roomo7time commented Sep 27, 2022 • edited

pengzhangzhi commented Oct 29, 2022 • edited

Saltychtao commented Nov 3, 2022

Saltychtao commented Nov 23, 2022

jzhang38 commented Mar 9, 2023

Saltychtao commented Apr 18, 2023

santisy commented May 13, 2024 • edited

lucidrains commented May 13, 2024 • edited

roomo7time commented Sep 27, 2022 •

edited

pengzhangzhi commented Oct 29, 2022 •

edited

santisy commented May 13, 2024 •

edited

lucidrains commented May 13, 2024 •

edited