Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

anyone meet zero grad?零梯度? #45

Closed
huangjch526 opened this issue Mar 4, 2024 · 1 comment
Closed

anyone meet zero grad?零梯度? #45

huangjch526 opened this issue Mar 4, 2024 · 1 comment
Labels
question Further information is requested

Comments

@huangjch526
Copy link

when I train latte on UCF101, the grad of linear layer are all zero. I think it is strange, 零梯度?

@maxin-cn
Copy link
Collaborator

maxin-cn commented Mar 4, 2024

when I train latte on UCF101, the grad of linear layer are all zero. I think it is strange, 零梯度?

Hi, in Latte model initialization, we adopt the widely used 0 initialization, which may result in a relatively small gradient of the corresponding layer.

@maxin-cn maxin-cn added the question Further information is requested label Mar 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants