vLLM inference servers container image fixes #582

seans3 · 2025-10-16T22:11:12Z

Updates the container image tag for vLLM inference server to the most recent (v0.11.0).
Adds LD_LIBRARY_PATH environment variables to make the server more robust. This environment variable can be missing from some container images.
- Ref: [Bug]: Image v0.9.0 Fails to Initialize on GCP instance Due to Undetected Platform vllm-project/vllm#18859
Adds vLLM parameters to ensure no out-of-memory issues (for this model).

seans3 · 2025-10-21T17:42:43Z

janetkuo · 2025-10-21T17:53:01Z

AI/vllm-deployment/vllm-deployment.yaml

        # 1 billion parameter model (smallest gemma model)
        - name: MODEL_ID
          value: google/gemma-3-1b-it
+        # Fixes - https://github.com/vllm-project/vllm/issues/18859


Is the information here meaningful to users? Is LD_LIBRARY_PATH only needed for certain version of vLLM? If so, that kind of information is good to capture here.

Good point. I've added the versions of the image that this is needed for. I've kept a reference to the issue, in case users need to do more research. Please let me know what you think.

janetkuo

/lgtm

k8s-ci-robot · 2025-10-21T21:16:26Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: janetkuo, seans3

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [janetkuo]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Oct 16, 2025

k8s-ci-robot requested review from janetkuo and soltysh October 16, 2025 22:11

k8s-ci-robot added the size/S Denotes a PR that changes 10-29 lines, ignoring generated files. label Oct 16, 2025

k8s-ci-robot assigned janetkuo Oct 21, 2025

janetkuo reviewed Oct 21, 2025

View reviewed changes

vllm container image fixes

a0ed769

seans3 force-pushed the container-image-fix branch from 7cbcf02 to a0ed769 Compare October 21, 2025 21:08

janetkuo approved these changes Oct 21, 2025

View reviewed changes

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Oct 21, 2025

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Oct 21, 2025

k8s-ci-robot merged commit 0b0f2eb into kubernetes:master Oct 21, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

vLLM inference servers container image fixes #582

vLLM inference servers container image fixes #582

seans3 commented Oct 16, 2025 •

edited

Loading

Uh oh!

seans3 commented Oct 21, 2025

Uh oh!

janetkuo Oct 21, 2025

Uh oh!

seans3 Oct 21, 2025 •

edited

Loading

Uh oh!

janetkuo left a comment

Uh oh!

k8s-ci-robot commented Oct 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vLLM inference servers container image fixes #582

vLLM inference servers container image fixes #582

Conversation

seans3 commented Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seans3 commented Oct 21, 2025

Uh oh!

janetkuo Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

seans3 Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

janetkuo left a comment

Choose a reason for hiding this comment

Uh oh!

k8s-ci-robot commented Oct 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

seans3 commented Oct 16, 2025 •

edited

Loading

seans3 Oct 21, 2025 •

edited

Loading