-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hi, can the code run probably? #1
Comments
I can run it, but it doesn't work,Did you solve it? |
No, it does not work for me either. |
I also found this problem. I'm try to solve it . |
can't converge at all |
Hi, it seems there is a bug in the L2 Attention module. The weights of the projection matrices for query and key in L2 Attention should be tied (same weights). Otherwise, the Lipschitzness cannot be guaranteed. This is missed in the code. Hope it could help. :-) |
Excuse me, big shots, did the code run successfully? How to run it? |
No description provided.
The text was updated successfully, but these errors were encountered: