Update CI docker image and set vllm eager enforce_eager to False by chtruong814 · Pull Request #614 · NVIDIA-NeMo/Export-Deploy

chtruong814 · 2026-02-21T09:20:55Z

Update CI docker image and set vllm eager enforce_eager to False

vllm checks if certain torch features are available based on the torch version. In particular, vllm 0.14.1 assumes 32 bit indexing is available for torch versions >= 2.10.0.dev. However, when installed in the 25.11 NGC Pytorch container, vllm believes that feature should work with the container's torch version. But this is not correct.

https://github.com/vllm-project/vllm/blob/v0.14.1/vllm/compilation/decorators.py#L524

So, this change updates the CI docker image with a patched version of vllm that does not treat the 25.11 NGC Pytorch version as >= 2.10.0.dev. We also previously set the VLLMExporter to default to enforce_eager=True. This change should enable enforce_eager=False.

Signed-off-by: Charlie Truong <chtruong@nvidia.com>

Set assume_32_bit_indexing to False for vllm

b1f8a76

Signed-off-by: Charlie Truong <chtruong@nvidia.com>

chtruong814 requested review from athitten, meatybobby, oyilmaz-nvidia and pthombre as code owners February 21, 2026 09:20

github-actions bot added export tests vLLM labels Feb 21, 2026

copy-pr-bot bot temporarily deployed to test February 21, 2026 09:21 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 21, 2026 09:28 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci February 21, 2026 09:28 Failure

copy-pr-bot bot temporarily deployed to nemo-ci February 21, 2026 09:28 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci February 21, 2026 09:28 Failure

Fix vllm unit test

9e465c2

Signed-off-by: Charlie Truong <chtruong@nvidia.com>

copy-pr-bot bot temporarily deployed to test February 21, 2026 12:35 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci February 21, 2026 12:36 Failure

copy-pr-bot bot had a problem deploying to nemo-ci February 21, 2026 12:36 Error

copy-pr-bot bot had a problem deploying to nemo-ci February 21, 2026 12:36 Failure

chtruong814 added 2 commits February 21, 2026 13:12

Fix unit test

466da9e

Signed-off-by: Charlie Truong <chtruong@nvidia.com>

Fix lint

8a934a6

Signed-off-by: Charlie Truong <chtruong@nvidia.com>

copy-pr-bot bot temporarily deployed to test February 21, 2026 14:47 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci February 21, 2026 14:54 Error

Override vllm torch version check

66c28af

Signed-off-by: Charlie Truong <chtruong@nvidia.com>

copy-pr-bot bot temporarily deployed to test February 21, 2026 15:23 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci February 21, 2026 15:28 Error

copy-pr-bot bot temporarily deployed to nemo-ci February 21, 2026 21:42 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci February 21, 2026 22:21 Failure

copy-pr-bot bot temporarily deployed to nemo-ci February 21, 2026 22:26 Inactive

copy-pr-bot bot temporarily deployed to test February 21, 2026 23:12 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci February 21, 2026 23:23 Error

copy-pr-bot bot temporarily deployed to test February 21, 2026 23:30 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 21, 2026 23:36 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci February 22, 2026 00:14 Failure

copy-pr-bot bot temporarily deployed to nemo-ci February 22, 2026 00:19 Inactive

copy-pr-bot bot temporarily deployed to test February 22, 2026 01:46 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 22, 2026 02:29 Inactive

Revert compilation conig changes

b07c33a

Signed-off-by: Charlie Truong <chtruong@nvidia.com>

ko3n1g approved these changes Feb 22, 2026

View reviewed changes

oyilmaz-nvidia approved these changes Feb 23, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update CI docker image and set vllm eager enforce_eager to False#614

Update CI docker image and set vllm eager enforce_eager to False#614
chtruong814 merged 11 commits intomainfrom
chtruong/fix-vllm

chtruong814 commented Feb 21, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

chtruong814 commented Feb 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

chtruong814 commented Feb 21, 2026 •

edited

Loading