vllm-project / vllm Public

Notifications
Fork 6.8k
Star 44.1k

Code
Issues 1.6k
Pull requests 551
Discussions
Actions
Projects 10
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: vllm-project/vllm

[Roadmap] vLLM Roadmap Q2 2025

#15735 opened Mar 29, 2025 by simon-mo

Open 2

[V1] Feedback Thread

#12568 opened Jan 30, 2025 by simon-mo

Open 85

Labels 45 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

1,641 Open 6,203 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

[Bug]: VLLM_USE_V1=0 is needed if prompt length equals max model length bug

Something isn't working

#16445 opened Apr 11, 2025 by OyvindTafjord

1 task done

[Usage]: 大量请求排队的时候推理速度很慢是什么原因 usage

How to use vllm

#16444 opened Apr 11, 2025 by pyaaaa

1 task done

[Bug]: Is V1 Enigne ready for DeepSeek-V1/R1 ? bug

Something isn't working

#16442 opened Apr 11, 2025 by handsome-chips

1 task done

[Bug]: [RLHF] Weights update broken with V1 multiprocessing bug

Something isn't working

#16434 opened Apr 10, 2025 by 22quinn

1 task done

[Bug]: Slow model loading from FSx storage in Kubernetes bug

Something isn't working

#16433 opened Apr 10, 2025 by shivam-dubey-1

1 task done

[Bug]: Qwen2.5 assistant output on tool call is empty bug

Something isn't working

#16430 opened Apr 10, 2025 by ItzAmirreza

1 task done

[Bug]: Cannot load Qwen2.5-VL bug

Something isn't working

#16429 opened Apr 10, 2025 by furkanc

1 task done

[Installation]: Installing only with Flash-Attn2 installation

Installation problems

#16427 opened Apr 10, 2025 by ziqipang

1 task done

[Bug]: Llama4 Scout fails on H200 bug

Something isn't working

#16414 opened Apr 10, 2025 by jjk-g

1 task done

[Feature]: (FIX) triton should be moved to requirements/cuda.txt feature request

New feature or request

#16413 opened Apr 10, 2025 by Shafi-Hussain

1 task done

[Bug]: corrupted double-linked list (not small) Aborted bug

Something isn't working

#16412 opened Apr 10, 2025 by qiuhaining

1 task done

[BENCHMARK] How to force output size in benchmark_serving.py? usage

How to use vllm

#16411 opened Apr 10, 2025 by dsantiago

[Usage]: xpxd is useless？ usage

How to use vllm

#16409 opened Apr 10, 2025 by tensorflowt

1 task done

[New Model]: Multimodal Embedding Model GME.

#16406 opened Apr 10, 2025 by Adenialzz

1 task done

[Usage]: UndefinedVar: Usage of undefined variable '$LD_PRELOAD' (line 54) usage

How to use vllm

#16404 opened Apr 10, 2025 by azhuvath

1 task done

[Bug]: CPU version cant run python3 with non-root user bug

Something isn't working

#16402 opened Apr 10, 2025 by yongfengdu

1 task done

[Usage]: How to use logit-processor in api server？ usage

How to use vllm

#16399 opened Apr 10, 2025 by Auraithm

1 task done

[Bug]: CUDA error: an illegal memory access was encountered bug

Something isn't working

#16398 opened Apr 10, 2025 by jifa513

1 task done

[Feature]: co-exist of multiply kv connector feature request

New feature or request

#16397 opened Apr 10, 2025 by maobaolong

1 task done

[Usage]: --kv-transfer-config is not supported by the V1 Engine. Falling back to V0. usage

How to use vllm

#16395 opened Apr 10, 2025 by wzh195879702

1 task done

[Bug]: AMD Instinct MI210 + vllm fail to run the official deepseek-r1 model: ValueError("type fp8e4b8 not supported in this architecture. The supported fp8 dtypes are ('fp8e5',)") bug

Something isn't working

#16394 opened Apr 10, 2025 by luciaganlulu

1 task done

[Bug]: Qwen2.5 tool call failed bug

Something isn't working

#16393 opened Apr 10, 2025 by kimlee1874

1 task done

[Bug]: I use VLLM to deploy DeepSeek, and when there are high concurrency requests, many of them are pending reqs. How can I set a timeout for these waiting requests? bug

Something isn't working

#16391 opened Apr 10, 2025 by nvliajia

1 task done

[Bug]: AMD Instinct MI210 + vllm fail to run deepseek-r1-awq model, any solutions please? Is there any other deepseek-r1-671b models that can run succesfully on AMD Instinct MI210 + vllm? Thanks! bug

Something isn't working

#16386 opened Apr 10, 2025 by luciaganlulu

1 task done

[Bug]: LLama4 Not working on PP bug

Something isn't working

#16385 opened Apr 10, 2025 by anujkhannac1

1 task done

Previous 1 2 3 4 5 … 65 66 Next

Previous Next

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly