anyone meet zero grad?零梯度？ #45

huangjch526 · 2024-03-04T12:27:00Z

when I train latte on UCF101, the grad of linear layer are all zero. I think it is strange, 零梯度？

maxin-cn · 2024-03-04T22:35:33Z

when I train latte on UCF101, the grad of linear layer are all zero. I think it is strange, 零梯度？

Hi, in Latte model initialization, we adopt the widely used 0 initialization, which may result in a relatively small gradient of the corresponding layer.

maxin-cn added the question Further information is requested label Mar 10, 2024

huangjch526 closed this as completed Mar 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

anyone meet zero grad?零梯度？ #45

anyone meet zero grad?零梯度？ #45

huangjch526 commented Mar 4, 2024

maxin-cn commented Mar 4, 2024

anyone meet zero grad?零梯度？ #45

anyone meet zero grad?零梯度？ #45

Comments

huangjch526 commented Mar 4, 2024

maxin-cn commented Mar 4, 2024