Question about Table 12 in the paper #13

yutaro-s · 2022-07-23T04:24:09Z

Hi. Thanks for the great work!
I'm trying to reproduce the results of Table 12 (impact of expander dimensionality) in your paper.
Could you teach me what hyperparameters you used in the experiments?

Adrien987k · 2022-07-26T08:24:37Z

Hi,

All the hyper-parameters are described in Section 4.2 of the paper, except for the base learning rate for 100 epochs of pretraining which is 0.3 instead of 0.2 (Described in Appendix C.4 VICReg Setup.).
The code runs on 8 GPUs with an effective batch size of 2048.
The hyper-parameter are the same for every embedding size of Table 12.

Hope this helps.

yutaro-s · 2022-07-26T13:09:16Z

Thanks for your quick reply. I'll try it.

yutaro-s closed this as completed Jul 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about Table 12 in the paper #13

Question about Table 12 in the paper #13

yutaro-s commented Jul 23, 2022

Adrien987k commented Jul 26, 2022

yutaro-s commented Jul 26, 2022

Question about Table 12 in the paper #13

Question about Table 12 in the paper #13

Comments

yutaro-s commented Jul 23, 2022

Adrien987k commented Jul 26, 2022

yutaro-s commented Jul 26, 2022