Add Model Support for xLSTM #27011

stefan-it · 2023-10-23T10:17:42Z

Model description

Inspired by recent rumors about xLSTM - a hidden successor to LSTM - by Sepp Hochreiter, this issue tracks the open source implementation about adding xLSTM to Transformers library.

Open source status

The model implementation is available here
The model weights are available

Provide useful links for the implementation

Paper is available here

At the moment no implementation does exist.

Only rumors that xLSTM surpasses GPT-2 on various (small) downstream datasets.

Good overview is the xLSTM Resources repository from @AI-Guru.

Pythoniasm · 2023-10-23T11:16:21Z

Sounds like a money grab. If it is something useful, he should have chosen the academic path or at least filing patent.

This way of boldly claiming success via non-serious media channels is highly unprofessional. It smells like publicity is more relevant than results which further supports motivations like funding/personal gains/politics.

DavidFarago · 2024-01-31T20:38:05Z

If I understood it correctly, a patent is on its way, and at least a paper about xLSTM will be published in less than 6 month.

KnutJaegersberg · 2024-02-02T10:35:58Z

I have some doubts if this is planned as an open source model.

albertz · 2024-05-08T07:10:20Z

Paper is published now: https://arxiv.org/abs/2405.04517

Ghost---Shadow · 2024-05-30T14:55:51Z

Need code and checkpoint or it didn't happen.

stefan-it · 2024-06-04T06:19:34Z

Official implementation is out now:

https://github.com/NX-AI/xlstm

danthe1st · 2024-06-04T08:07:33Z

Note that the official source code is AGPL-licensed.

stefan-it added the New model label Oct 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Model Support for xLSTM #27011

Add Model Support for xLSTM #27011

stefan-it commented Oct 23, 2023 •

edited

Pythoniasm commented Oct 23, 2023 •

edited

DavidFarago commented Jan 31, 2024

KnutJaegersberg commented Feb 2, 2024

albertz commented May 8, 2024

Ghost---Shadow commented May 30, 2024

stefan-it commented Jun 4, 2024

danthe1st commented Jun 4, 2024

Add Model Support for xLSTM #27011

Add Model Support for xLSTM #27011

Comments

stefan-it commented Oct 23, 2023 • edited

Model description

Open source status

Provide useful links for the implementation

Pythoniasm commented Oct 23, 2023 • edited

DavidFarago commented Jan 31, 2024

KnutJaegersberg commented Feb 2, 2024

albertz commented May 8, 2024

Ghost---Shadow commented May 30, 2024

stefan-it commented Jun 4, 2024

danthe1st commented Jun 4, 2024

stefan-it commented Oct 23, 2023 •

edited

Pythoniasm commented Oct 23, 2023 •

edited