vllm-project / vllm Public

Notifications
Fork 6.3k
Star 41.8k

Code
Issues 1.5k
Pull requests 516
Discussions
Actions
Projects 7
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: vllm-project/vllm

[Roadmap] vLLM Roadmap Q1 2025

#11862 opened Jan 8, 2025 by simon-mo

Open 7

[V1] Feedback Thread

#12568 opened Jan 30, 2025 by simon-mo

Open 64

Labels 42 Milestones 1

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

1,457 Open 5,731 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

[Bug]: could not find executable path vllm-openai in any version of the vllm/vllm-openai:latestimage bug

Something isn't working

#15007 opened Mar 18, 2025 by Chennakesavulu5

[Doc]: ValueError: Qwen2_5_VLForSequenceClassification has no vLLM implementation and the Transformers implementation is not compatible with vLLM. documentation

Improvements or additions to documentation

#15006 opened Mar 18, 2025 by Chenhaolin6

1 task done

[Bug]: Failed to Run Qwen2.5-7B with RTX 3070 & CPU Offload (14GB) Despite Sufficient Theoretical Memory bug

Something isn't working

#15004 opened Mar 18, 2025 by Royom

1 task done

[Bug]: DeepSeek-R1-AWQ broken in nightly bug

Something isn't working

#15002 opened Mar 18, 2025 by eldarkurtic

1 task done

[Usage]: Can vllm multimodal generate use preprocessed image? usage

How to use vllm

#14998 opened Mar 18, 2025 by pjgao

1 task done

[Usage]: How to use vllm in parallel usage

How to use vllm

#14997 opened Mar 18, 2025 by bravelyi

1 task done

[Bug]: CDNA cc >= 90, choose_mp_linear_kernel MacheteLinearKernel is possible bug

Something isn't working

#14996 opened Mar 18, 2025 by bjj

1 task done

[Bug]: engine_client.dead_error bug

Something isn't working

#14994 opened Mar 18, 2025 by tingjun-cs

1 task done

[Bug]: 使用Sonatype Nexus Repository时下载模型错误。 bug

Something isn't working

#14993 opened Mar 18, 2025 by paddy235

1 task done

[Bug]: ValueError: No available memory for the cache blocks on main branch after commit 46f98893 bug

Something isn't working

#14992 opened Mar 18, 2025 by engchina

1 task done

[Bug]: Docker image in trunk cannot find libpython.so bug

Something isn't working

#14991 opened Mar 18, 2025 by maobaolong

1 task done

[Bug]: vllm 0.7.3-0.8.0rc3 - QWEN2.5-VL: video_grid_thw[video_index][0], [multiproc_executor.py:375] IndexError: list index out of range bug

Something isn't working

#14986 opened Mar 17, 2025 by denadai2

1 task done

[Bug]: [AMD] [vLLM=0.7.3] ValueError: Model architectures ['Qwen2_5_VLForConditionalGeneration'] failed to be inspected. bug

Something isn't working

#14983 opened Mar 17, 2025 by iraj465

1 task done

[RFC]: vLLM Windows CUDA support RFC

#14981 opened Mar 17, 2025 by SystemPanic

1 task done

[Bug]: Model weights in GiB bug

Something isn't working

#14979 opened Mar 17, 2025 by plp38

1 task done

[Usage]: Torch 2.5.1 with latest main branch usage

How to use vllm

#14973 opened Mar 17, 2025 by xihuai18

1 task done

[Bug]: can't pickle model config error on V0 engine for deepseek-r1 bug

Something isn't working

#14966 opened Mar 17, 2025 by cjackal

1 task done

[Feature] [ROCm]: AITER Kernel Integration feature request

New feature or request

rocm

Related to AMD ROCm

#14964 opened Mar 17, 2025 by tjtanaa

1 of 6 tasks

[Bug]: Gemma3 ValueError: Attempted to assign 256 + 256 = 512 multimodal tokens to 1536 placeholders bug

Something isn't working

#14963 opened Mar 17, 2025 by tikboaHIT

1 task done

[Bug]: vLLM running on Unspecified Platform raises NotImplementedError when using podman/docker-compose bug

Something isn't working

#14954 opened Mar 17, 2025 by BastianBN

1 task done

[Bug]: Disaggregated Prefilling use different TP between prefill instance and decode instance , it will be hanged bug

Something isn't working

#14952 opened Mar 17, 2025 by 67lc

1 task done

[Bug]: vLLM response on tool_calls does not align with OpenAI standard bug

Something isn't working

#14951 opened Mar 17, 2025 by mshensg

1 task done

[Bug]: macOS launch fails with --tensor-parallel-size 2 bug

Something isn't working

#14947 opened Mar 17, 2025 by kebe7jun

1 task done

[Usage]: usage

How to use vllm

#14944 opened Mar 17, 2025 by LUYserena

1 task done

[Performance]: Speculative Decoding vs. Standard Inference performance

Performance-related issues

#14941 opened Mar 17, 2025 by NEWbie0709

1 task done

Previous 1 2 3 4 5 … 58 59 Next

Previous Next

ProTip! Find all open issues with in progress development work with linked:pr.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly