-
-
Notifications
You must be signed in to change notification settings - Fork 6.3k
Issues: vllm-project/vllm
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[RFC]: Automate Speculative Decoding
RFC
stale
Over 90 days of inactivity
#4565
opened May 2, 2024 by
LiuXiaoxuanPKU
[RFC]: Implement disaggregated prefilling via KV cache transfer
RFC
#5557
opened Jun 14, 2024 by
KuntaiDu
[RFC]: Pipeline-Parallelism for vLLM V1
RFC
v1
#11945
opened Jan 10, 2025 by
ruisearch42
1 task done
[RFC]: Disaggregated prefilling and KV cache transfer roadmap
RFC
#10818
opened Dec 2, 2024 by
KuntaiDu
4 of 28 tasks
[Feature]: DeepSeek-R1 tool choice && Function Call
feature request
New feature or request
#12297
opened Jan 22, 2025 by
warlockedward
1 task done
[New Model]: support minimax-01
new model
Requests to new models
#12073
opened Jan 15, 2025 by
liyawei87
1 task done
[Feature]: Integrate Writing in the Margins inference pattern ($5,000 Bounty)
feature request
New feature or request
unstale
Recieved activity after being labelled stale
#9807
opened Oct 29, 2024 by
melisa-writer
1 task done
Add support for ReFT
feature request
New feature or request
stale
Over 90 days of inactivity
#4413
opened Apr 27, 2024 by
RonanKMcGovern
[Feature]: Multi-Token Prediction (MTP)
feature request
New feature or request
#12181
opened Jan 18, 2025 by
casper-hansen
1 task done
[Feature]: Add qwen2.5-VL-7B-Instruct video support
feature request
New feature or request
#13050
opened Feb 10, 2025 by
kelvinzhao1
1 task done
[RFC]: Async KV Cache Transfer for Disaggregated Inference
RFC
#13020
opened Feb 10, 2025 by
VertexC
1 task done
[RFC]: [V1] TPU support and multiple architecture support
RFC
v1
#12480
opened Jan 27, 2025 by
alexm-redhat
1 task done
[Feature]: support for Cambricon MLU
feature request
New feature or request
unstale
Recieved activity after being labelled stale
#9649
opened Oct 24, 2024 by
a120092009
1 task done
[Feature]: Expose option to load new model weights from disk
feature request
New feature or request
#12774
opened Feb 5, 2025 by
edbeeching
1 task done
[Bug]: [v0.6.5] Streaming tool call responses with the hermes template is inconsistent with the non-stream version.
bug
Something isn't working
#11392
opened Dec 21, 2024 by
elementary-particle
1 task done
[Performance]: How to Improve Performance Under Concurrency
performance
Performance-related issues
#9722
opened Oct 26, 2024 by
ljwps
1 task done
Previous Next
ProTip!
Updated in the last three days: updated:>2025-03-14.