Skip to content

Update CI docker image and set vllm eager enforce_eager to False#614

Merged
chtruong814 merged 11 commits intomainfrom
chtruong/fix-vllm
Feb 23, 2026
Merged

Update CI docker image and set vllm eager enforce_eager to False#614
chtruong814 merged 11 commits intomainfrom
chtruong/fix-vllm

Conversation

@chtruong814
Copy link
Copy Markdown
Contributor

@chtruong814 chtruong814 commented Feb 21, 2026

Update CI docker image and set vllm eager enforce_eager to False

vllm checks if certain torch features are available based on the torch version. In particular, vllm 0.14.1 assumes 32 bit indexing is available for torch versions >= 2.10.0.dev. However, when installed in the 25.11 NGC Pytorch container, vllm believes that feature should work with the container's torch version. But this is not correct.

https://github.com/vllm-project/vllm/blob/v0.14.1/vllm/compilation/decorators.py#L524

So, this change updates the CI docker image with a patched version of vllm that does not treat the 25.11 NGC Pytorch version as >= 2.10.0.dev. We also previously set the VLLMExporter to default to enforce_eager=True. This change should enable enforce_eager=False.

Signed-off-by: Charlie Truong <chtruong@nvidia.com>
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CI export r0.4.0 Cherry-pick PR to r0.4.0 release branch tests vLLM

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants