Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Introduce speculative decoding with draft models to vLLM #3029

Closed
wants to merge 3 commits into from

Speculative decoding with draft model

6c87e92
Select commit
Failed to load commit list.
Closed

Introduce speculative decoding with draft models to vLLM #3029

Speculative decoding with draft model
6c87e92
Select commit
Failed to load commit list.

Workflow runs completed with no jobs