Skip to content

Issues: vllm-project/vllm

[Roadmap] vLLM Roadmap Q2 2025
#15735 opened Mar 29, 2025 by simon-mo
Open 2
[V1] Feedback Thread
#12568 opened Jan 30, 2025 by simon-mo
Open 85
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Bug]: VLLM_USE_V1=0 is needed if prompt length equals max model length bug Something isn't working
#16445 opened Apr 11, 2025 by OyvindTafjord
1 task done
[Usage]: 大量请求排队的时候推理速度很慢是什么原因 usage How to use vllm
#16444 opened Apr 11, 2025 by pyaaaa
1 task done
[Bug]: Is V1 Enigne ready for DeepSeek-V1/R1 ? bug Something isn't working
#16442 opened Apr 11, 2025 by handsome-chips
1 task done
[Bug]: [RLHF] Weights update broken with V1 multiprocessing bug Something isn't working
#16434 opened Apr 10, 2025 by 22quinn
1 task done
[Bug]: Slow model loading from FSx storage in Kubernetes bug Something isn't working
#16433 opened Apr 10, 2025 by shivam-dubey-1
1 task done
[Bug]: Qwen2.5 assistant output on tool call is empty bug Something isn't working
#16430 opened Apr 10, 2025 by ItzAmirreza
1 task done
[Bug]: Cannot load Qwen2.5-VL bug Something isn't working
#16429 opened Apr 10, 2025 by furkanc
1 task done
[Installation]: Installing only with Flash-Attn2 installation Installation problems
#16427 opened Apr 10, 2025 by ziqipang
1 task done
[Bug]: Llama4 Scout fails on H200 bug Something isn't working
#16414 opened Apr 10, 2025 by jjk-g
1 task done
[Feature]: (FIX) triton should be moved to requirements/cuda.txt feature request New feature or request
#16413 opened Apr 10, 2025 by Shafi-Hussain
1 task done
[Bug]: corrupted double-linked list (not small) Aborted bug Something isn't working
#16412 opened Apr 10, 2025 by qiuhaining
1 task done
[Usage]: xpxd is useless? usage How to use vllm
#16409 opened Apr 10, 2025 by tensorflowt
1 task done
[New Model]: Multimodal Embedding Model GME.
#16406 opened Apr 10, 2025 by Adenialzz
1 task done
[Bug]: CPU version cant run python3 with non-root user bug Something isn't working
#16402 opened Apr 10, 2025 by yongfengdu
1 task done
[Usage]: How to use logit-processor in api server? usage How to use vllm
#16399 opened Apr 10, 2025 by Auraithm
1 task done
[Bug]: CUDA error: an illegal memory access was encountered bug Something isn't working
#16398 opened Apr 10, 2025 by jifa513
1 task done
[Feature]: co-exist of multiply kv connector feature request New feature or request
#16397 opened Apr 10, 2025 by maobaolong
1 task done
[Bug]: Qwen2.5 tool call failed bug Something isn't working
#16393 opened Apr 10, 2025 by kimlee1874
1 task done
[Bug]: LLama4 Not working on PP bug Something isn't working
#16385 opened Apr 10, 2025 by anujkhannac1
1 task done
ProTip! Type g i on any issue or pull request to go back to the issue listing page.