-
-
Notifications
You must be signed in to change notification settings - Fork 6.8k
Issues: vllm-project/vllm
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Bug]: VLLM_USE_V1=0 is needed if prompt length equals max model length
bug
Something isn't working
#16445
opened Apr 11, 2025 by
OyvindTafjord
1 task done
[Usage]: 大量请求排队的时候推理速度很慢是什么原因
usage
How to use vllm
#16444
opened Apr 11, 2025 by
pyaaaa
1 task done
[Bug]: Is V1 Enigne ready for DeepSeek-V1/R1 ?
bug
Something isn't working
#16442
opened Apr 11, 2025 by
handsome-chips
1 task done
[Bug]: [RLHF] Weights update broken with V1 multiprocessing
bug
Something isn't working
#16434
opened Apr 10, 2025 by
22quinn
1 task done
[Bug]: Slow model loading from FSx storage in Kubernetes
bug
Something isn't working
#16433
opened Apr 10, 2025 by
shivam-dubey-1
1 task done
[Bug]: Qwen2.5 assistant output on tool call is empty
bug
Something isn't working
#16430
opened Apr 10, 2025 by
ItzAmirreza
1 task done
[Bug]: Cannot load Qwen2.5-VL
bug
Something isn't working
#16429
opened Apr 10, 2025 by
furkanc
1 task done
[Installation]: Installing only with Flash-Attn2
installation
Installation problems
#16427
opened Apr 10, 2025 by
ziqipang
1 task done
[Bug]: Llama4 Scout fails on H200
bug
Something isn't working
#16414
opened Apr 10, 2025 by
jjk-g
1 task done
[Feature]: (FIX) New feature or request
triton
should be moved to requirements/cuda.txt
feature request
#16413
opened Apr 10, 2025 by
Shafi-Hussain
1 task done
[Bug]: corrupted double-linked list (not small) Aborted
bug
Something isn't working
#16412
opened Apr 10, 2025 by
qiuhaining
1 task done
[BENCHMARK] How to force output size in benchmark_serving.py?
usage
How to use vllm
#16411
opened Apr 10, 2025 by
dsantiago
[Usage]: xpxd is useless?
usage
How to use vllm
#16409
opened Apr 10, 2025 by
tensorflowt
1 task done
[Usage]: UndefinedVar: Usage of undefined variable '$LD_PRELOAD' (line 54)
usage
How to use vllm
#16404
opened Apr 10, 2025 by
azhuvath
1 task done
[Bug]: CPU version cant run python3 with non-root user
bug
Something isn't working
#16402
opened Apr 10, 2025 by
yongfengdu
1 task done
[Usage]: How to use logit-processor in api server?
usage
How to use vllm
#16399
opened Apr 10, 2025 by
Auraithm
1 task done
[Bug]: CUDA error: an illegal memory access was encountered
bug
Something isn't working
#16398
opened Apr 10, 2025 by
jifa513
1 task done
[Feature]: co-exist of multiply kv connector
feature request
New feature or request
#16397
opened Apr 10, 2025 by
maobaolong
1 task done
[Usage]: --kv-transfer-config is not supported by the V1 Engine. Falling back to V0.
usage
How to use vllm
#16395
opened Apr 10, 2025 by
wzh195879702
1 task done
[Bug]: AMD Instinct MI210 + vllm fail to run the official deepseek-r1 model: ValueError("type fp8e4b8 not supported in this architecture. The supported fp8 dtypes are ('fp8e5',)")
bug
Something isn't working
#16394
opened Apr 10, 2025 by
luciaganlulu
1 task done
[Bug]: Qwen2.5 tool call failed
bug
Something isn't working
#16393
opened Apr 10, 2025 by
kimlee1874
1 task done
[Bug]: I use VLLM to deploy DeepSeek, and when there are high concurrency requests, many of them are pending reqs. How can I set a timeout for these waiting requests?
bug
Something isn't working
#16391
opened Apr 10, 2025 by
nvliajia
1 task done
[Bug]: AMD Instinct MI210 + vllm fail to run deepseek-r1-awq model, any solutions please? Is there any other deepseek-r1-671b models that can run succesfully on AMD Instinct MI210 + vllm? Thanks!
bug
Something isn't working
#16386
opened Apr 10, 2025 by
luciaganlulu
1 task done
[Bug]: LLama4 Not working on PP
bug
Something isn't working
#16385
opened Apr 10, 2025 by
anujkhannac1
1 task done
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.