Skip to content

Issues: vllm-project/vllm

[Roadmap] vLLM Roadmap Q1 2025
#11862 opened Jan 8, 2025 by simon-mo
Open 9
[V1] Feedback Thread
#12568 opened Jan 30, 2025 by simon-mo
Open 79
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

DeciLMConfig object has no attribute ‘num_key_value_heads_per_layer’ For Nemotron bug Something isn't working
#15625 opened Mar 27, 2025 by manitadayon
1 task done
[Bug]: vllm 0.8.2 have severe quality problem bug Something isn't working
#15622 opened Mar 27, 2025 by aabbccddwasd
1 task done
[Bug]: Triton JIT Compile Regression from PR 15511 bug Something isn't working
#15619 opened Mar 27, 2025 by Qubitium
1 task done
[New Model]: Please support Babel series model ASAP new model Requests to new models
#15612 opened Mar 27, 2025 by ifyoulovexxz
1 task done
[Bug]: Failed to run deepseek v2 lite model with tp = 4 bug Something isn't working
#15607 opened Mar 27, 2025 by jiangjiadi
1 task done
[Usage]: Will dynamo be on vllm main branch? usage How to use vllm
#15606 opened Mar 27, 2025 by johnnynunez
1 task done
[Bug]: Failed to run deepseek v2 lite model with tp = 8 when enabling expert parallel bug Something isn't working
#15604 opened Mar 27, 2025 by jiangjiadi
1 task done
[Performance]: How to install and use vLLM to serve multiple large language models performance Performance-related issues
#15602 opened Mar 27, 2025 by moshilangzi
1 task done
[Bug]: Gemma3 GPU memory usage is always oom bug Something isn't working
#15599 opened Mar 27, 2025 by lyj157175
1 task done
[Bug]: Model Reasoning Warning bug Something isn't working
#15596 opened Mar 27, 2025 by Eduiskss
1 task done
[Bug]:ModuleNotFoundError: No module named 'vllm._C' bug Something isn't working
#15592 opened Mar 27, 2025 by lastlastsummer
[Bug]: DeepSeek R1 with V1+FLASHMLA on L40S bug Something isn't working
#15590 opened Mar 27, 2025 by longqu
1 task done
[Bug]: guided_json not working correctly with (quantized) mistral-small model bug Something isn't working
#15577 opened Mar 26, 2025 by VMinB12
1 task done
TP4 fails with 5090 in the mix
#15576 opened Mar 26, 2025 by pavanimajety
[Bug]: Vllm 0.8.2 + Ray 2.44 (Ray serve deployment) fallbacks to V0 Engine bug Something isn't working
#15569 opened Mar 26, 2025 by Qasimk555
1 task done
ProTip! no:milestone will show everything without a milestone.