Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support DeepSpeed FastGen #1538

Open
thiner opened this issue Jan 3, 2024 · 2 comments
Open

Support DeepSpeed FastGen #1538

thiner opened this issue Jan 3, 2024 · 2 comments
Assignees

Comments

@thiner
Copy link
Contributor

thiner commented Jan 3, 2024

Is your feature request related to a problem? Please describe.

No.

Describe the solution you'd like

DeepSpeed FastGen is an inference framework developed by MicroSoft. They claim that it's two times faster than vllm. https://github.com/microsoft/DeepSpeed/tree/master/blogs/deepspeed-fastgen

Describe alternatives you've considered

No.

Additional context

I haven't tested FastGen, just attracted by their blog. I searched in this repo, seems no one mentioned this framework yet, so I'd like to bring it to the attention of community.

@thiner thiner added the enhancement New feature or request label Jan 3, 2024
@thiner thiner changed the title Support Support DeepSpeed FastGen Jan 3, 2024
@mudler mudler added the roadmap label Jan 3, 2024
@thiner
Copy link
Contributor Author

thiner commented Jan 3, 2024

Glad to see you have added it to the roadmap.

@mudler
Copy link
Owner

mudler commented Jan 3, 2024

Glad to see you have added it to the roadmap.

sounds a solid backend to have, thanks for the tip 👍 good to see that there is interest in this backend being added. Definetly a good addition for LocalAI

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants