Skip to content

Issues: vllm-project/vllm

[Roadmap] vLLM Roadmap Q1 2025
#11862 opened Jan 8, 2025 by simon-mo
Open 8
[V1] Feedback Thread
#12568 opened Jan 30, 2025 by simon-mo
Open 72
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Feature]: looking into adding a generation algorithm feature request New feature or request
#15315 opened Mar 22, 2025 by tiger241
1 task done
[Usage]: Async engine batch request usage How to use vllm
#15314 opened Mar 21, 2025 by tiger241
1 task done
[Bug]: vLLM declares itself healthy before it can serve requests bug Something isn't working
#15313 opened Mar 21, 2025 by kiratp
1 task done
[Bug]: Crashing on unsupported Sampling params bug Something isn't working
#15312 opened Mar 21, 2025 by kiratp
1 task done
[Bug]: OPEA/Mistral-Small-3.1-24B-Instruct-2503-int4-AutoRound-awq-sym error bug Something isn't working
#15300 opened Mar 21, 2025 by moshilangzi
1 task done
[Bug]: Worker VllmWorkerProcess pid 000000 died, exit code: -15 bug Something isn't working
#15295 opened Mar 21, 2025 by a7mad911
1 task done
[Bug]: Critical Memory Leak in vLLM V1 Engine: 200+ GB RAM Usage from Image Inference bug Something isn't working
#15294 opened Mar 21, 2025 by oyerli
1 task done
[Bug]: Qwen2.5 VL online service can not input video and image simultaneously. bug Something isn't working
#15291 opened Mar 21, 2025 by Thyme-git
1 task done
[Feature]: Dynamic Memory Release for GPU after idle time feature request New feature or request
#15287 opened Mar 21, 2025 by kmamine
1 task done
[Usage]: why no ray command in my docker image usage How to use vllm
#15284 opened Mar 21, 2025 by yanzhichao
1 task done
[Bug]: int8 2:4 sparse time more than fp8 bug Something isn't working
#15275 opened Mar 21, 2025 by zhink
1 task done
[Bug]:streming is lost in arguments in tool_calls bug Something isn't working
#15274 opened Mar 21, 2025 by xiaodizi
1 task done
[Bug]: Inconsistent Output Based on Presence of chat_template Parameter bug Something isn't working
#15272 opened Mar 21, 2025 by SmartManoj
1 task done
vector search feature request New feature or request
#15268 opened Mar 21, 2025 by 20246688
1 task
[Feature]: Can support CPU inference with Ray cluster? feature request New feature or request
#15266 opened Mar 21, 2025 by MaoJianwei
1 task done
[Bug]: qwen2.5vl cannot use fp8 quantization bug Something isn't working
#15264 opened Mar 21, 2025 by lessmore991
1 task done
[Bug]: oracle for device checking raise exception unexpectly bug Something isn't working
#15263 opened Mar 21, 2025 by Selkh
1 task done
[Bug]: OOM with QwQ-32B bug Something isn't working
#15258 opened Mar 21, 2025 by vmajor
1 task done
[Bug]: --tensor-parallel-size Error bug Something isn't working
#15255 opened Mar 20, 2025 by IAMJOYBO
1 task done
[Performance]: V0 and V1 give the same throughput number performance Performance-related issues
#15253 opened Mar 20, 2025 by DanlinJia
1 task done
ProTip! Exclude everything labeled bug with -label:bug.