-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The tensor output by self.vertice_mapping in the TransformerEncoder of stage1 is all nan #10
Comments
I have not encountered this before. Are you using the default config for training? It may be solved by scaling down the learning rate I guess? |
I has the same problem.
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Generally, when training to the second epoch, the output results are all nan. At this time, I check the bias and weight of the linear layer, and the results are all nan.
The text was updated successfully, but these errors were encountered: