Welcome to gateloop-transformer Discussions! #1
Replies: 4 comments 21 replies
-
thoughts about paper? |
Beta Was this translation helpful? Give feedback.
-
In the updated readme you mention:
Is this at all because of how you had to implement it in pytorch? I.e., would you expect better results in jax with its native associate scan? The paper is also suspiciously missing any mention of training compute comparisons, so maybe it's a fundamental issue. |
Beta Was this translation helpful? Give feedback.
-
Awesome work! |
Beta Was this translation helpful? Give feedback.
-
The official code has been released : https://github.com/tobiaskatsch/GateLoop |
Beta Was this translation helpful? Give feedback.
-
👋 Welcome!
We’re using Discussions as a place to connect with other members of our community. We hope that you:
build together 💪.
To get started, comment below with an introduction of yourself and tell us about what you do with this community.
Beta Was this translation helpful? Give feedback.
All reactions