Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Model Support for xLSTM #27011

Open
2 of 3 tasks
stefan-it opened this issue Oct 23, 2023 · 7 comments
Open
2 of 3 tasks

Add Model Support for xLSTM #27011

stefan-it opened this issue Oct 23, 2023 · 7 comments

Comments

@stefan-it
Copy link
Collaborator

stefan-it commented Oct 23, 2023

Model description

Inspired by recent rumors about xLSTM - a hidden successor to LSTM - by Sepp Hochreiter, this issue tracks the open source implementation about adding xLSTM to Transformers library.

Open source status

  • The model implementation is available here
  • The model weights are available

Provide useful links for the implementation

  • Paper is available here

At the moment no implementation does exist.

Only rumors that xLSTM surpasses GPT-2 on various (small) downstream datasets.

Good overview is the xLSTM Resources repository from @AI-Guru.

@Pythoniasm
Copy link

Pythoniasm commented Oct 23, 2023

Sounds like a money grab. If it is something useful, he should have chosen the academic path or at least filing patent.

This way of boldly claiming success via non-serious media channels is highly unprofessional. It smells like publicity is more relevant than results which further supports motivations like funding/personal gains/politics.

@DavidFarago
Copy link

If I understood it correctly, a patent is on its way, and at least a paper about xLSTM will be published in less than 6 month.

@KnutJaegersberg
Copy link

I have some doubts if this is planned as an open source model.

@albertz
Copy link

albertz commented May 8, 2024

Paper is published now: https://arxiv.org/abs/2405.04517

@Ghost---Shadow
Copy link

Need code and checkpoint or it didn't happen.

@stefan-it
Copy link
Collaborator Author

Official implementation is out now:

https://github.com/NX-AI/xlstm

@danthe1st
Copy link

Note that the official source code is AGPL-licensed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

7 participants