Support DeepSpeed FastGen #1538

thiner · 2024-01-03T10:54:42Z

Is your feature request related to a problem? Please describe.

No.

Describe the solution you'd like

DeepSpeed FastGen is an inference framework developed by MicroSoft. They claim that it's two times faster than vllm. https://github.com/microsoft/DeepSpeed/tree/master/blogs/deepspeed-fastgen

Describe alternatives you've considered

No.

Additional context

I haven't tested FastGen, just attracted by their blog. I searched in this repo, seems no one mentioned this framework yet, so I'd like to bring it to the attention of community.

thiner · 2024-01-03T14:28:40Z

Glad to see you have added it to the roadmap.

mudler · 2024-01-03T14:35:36Z

Glad to see you have added it to the roadmap.

sounds a solid backend to have, thanks for the tip 👍 good to see that there is interest in this backend being added. Definetly a good addition for LocalAI

thiner added the enhancement New feature or request label Jan 3, 2024

thiner assigned mudler Jan 3, 2024

thiner changed the title ~~Support~~ Support DeepSpeed FastGen Jan 3, 2024

mudler added the roadmap label Jan 3, 2024

mudler mentioned this issue Jan 3, 2024

[EPIC] Model support dashboard (v2) #1126

Open

90 tasks

mudler added area/ai-model area/backends labels Jan 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support DeepSpeed FastGen #1538

Support DeepSpeed FastGen #1538

thiner commented Jan 3, 2024 •

edited

Loading

thiner commented Jan 3, 2024

mudler commented Jan 3, 2024 •

edited

Loading

Support DeepSpeed FastGen #1538

Support DeepSpeed FastGen #1538

Comments

thiner commented Jan 3, 2024 • edited Loading

thiner commented Jan 3, 2024

mudler commented Jan 3, 2024 • edited Loading

thiner commented Jan 3, 2024 •

edited

Loading

mudler commented Jan 3, 2024 •

edited

Loading