Update dsr1-fp8-b200-sglang and -mtp SGLang image to v0.5.12-cu130 by Klaud-Cold · Pull Request #1416 · SemiAnalysisAI/InferenceX

Klaud-Cold · 2026-05-17T03:16:01Z

Updates SGLang image for dsr1-fp8-b200-sglang and dsr1-fp8-b200-sglang-mtp from v0.5.11-cu130 to v0.5.12-cu130.
\nRef #1154

Generated with Claude Code

… to v0.5.12-cu130 Ref #1154 Co-authored-by: Klaud Cold <Klaud-Cold@users.noreply.github.com>

github-actions · 2026-05-17T03:16:09Z

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your single node PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow

As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers.

If additional help is needed, PR authors can reach out to core maintainers over Slack.

github-actions · 2026-05-17T03:16:09Z

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your single node PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow

As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers.

If additional help is needed, PR authors can reach out to core maintainers over Slack.

github-actions · 2026-05-17T03:16:26Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=25980011920
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=25980011920

claude · 2026-05-17T03:20:08Z

+    - dsr1-fp8-b200-sglang-mtp
+  description:
+    - "Update SGLang image from v0.5.11-cu130 to v0.5.12-cu130"
+  pr-link: XXX


🟡 The new entry added to perf-changelog.yaml uses pr-link: XXX as an unfilled placeholder instead of the actual PR URL. Every other entry in this file uses a proper GitHub PR URL (e.g., https://github.com/SemiAnalysisAI/InferenceX/pull/1402 on line 2550); this should be https://github.com/SemiAnalysisAI/InferenceX/pull/1416 so the changelog entry remains traceable to its source PR.

Extended reasoning...

What the bug is

The diff appends a new entry to perf-changelog.yaml documenting the SGLang image bump for dsr1-fp8-b200-sglang and dsr1-fp8-b200-sglang-mtp. The last line of that new entry reads:

pr-link: XXX

XXX is a placeholder string, not a valid URL. It is the literal token typically left behind when a template-driven changelog entry is generated and the author/automation forgets to substitute in the real PR number.

Why this matters

Every other entry in perf-changelog.yaml follows a clear convention: pr-link is a full GitHub URL pointing at the PR that introduced the change. For example, the four entries immediately preceding this one are:

line 2532: pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1407

line 2538: pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1408

line 2544: pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1409

line 2550: pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1402

The new entry breaks this convention. The changelog is the authoritative record connecting an image/config change to the PR that motivated it; leaving XXX in place severs that link and makes future debugging harder when someone wants to understand why SGLang was bumped from v0.5.11-cu130 to v0.5.12-cu130.

Step-by-step proof

View perf-changelog.yaml lines 2548–2557 — the diff appends a new list element under the top-level list.

The element has three keys: config-keys (a list with two recipes), description (a one-line summary), and pr-link.

The value of pr-link is the literal string XXX — not a URL, not a number, not a reference.

Compare to any other pr-link value in the file: they are all of the form https://github.com/SemiAnalysisAI/InferenceX/pull/<N>.

The current PR is Update dsr1-fp8-b200-sglang and -mtp SGLang image to v0.5.12-cu130 #1416, so the expected value is https://github.com/SemiAnalysisAI/InferenceX/pull/1416.

How to fix

Replace line 2557 with:

pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1416

Severity rationale

This is a data-quality / metadata issue in a changelog file — it does not affect runtime behavior of any recipe or benchmark. However, it is also the only non-trivial content change in this PR's perf-changelog.yaml diff, so fixing it before merge is cheap and keeps the changelog convention intact. Flagging as a nit.

# Conflicts: # perf-changelog.yaml

github-actions · 2026-05-17T07:21:48Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=25980012131
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=25980012131

github-actions · 2026-05-17T13:22:16Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=25984496981
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=25984496981

functionstackx · 2026-05-17T18:01:28Z

/reuse-sweep-run

github-actions · 2026-05-17T18:02:04Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=25998506468
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=25998506468

Update dsr1-fp8-b200-sglang and dsr1-fp8-b200-sglang-mtp SGLang image…

3894f0d

… to v0.5.12-cu130 Ref #1154 Co-authored-by: Klaud Cold <Klaud-Cold@users.noreply.github.com>

Klaud-Cold requested a review from a team May 17, 2026 03:16

Klaud-Cold added the full-sweep-enabled label May 17, 2026

Klaud-Cold requested review from jgangani and kedarpotdar-nv as code owners May 17, 2026 03:16

Klaud-Cold added the full-sweep-enabled label May 17, 2026

github-project-automation Bot added this to InferenceMAX Board May 17, 2026

Klaud-Cold mentioned this pull request May 17, 2026

[Auto] Docker Image Updates Available - 2026-04-25 #1154

Open

claude Bot reviewed May 17, 2026

View reviewed changes

Merge remote-tracking branch 'origin/main' into HEAD

af7bd35

# Conflicts: # perf-changelog.yaml

Merge branch 'main' into claude/issue-1154-dsr1-fp8-b200-sglang

d911a84

functionstackx merged commit 8b89206 into main May 17, 2026
3 of 5 checks passed

functionstackx deleted the claude/issue-1154-dsr1-fp8-b200-sglang branch May 17, 2026 18:01

github-project-automation Bot moved this to Done in InferenceMAX Board May 17, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update dsr1-fp8-b200-sglang and -mtp SGLang image to v0.5.12-cu130#1416

Update dsr1-fp8-b200-sglang and -mtp SGLang image to v0.5.12-cu130#1416
functionstackx merged 3 commits into
mainfrom
claude/issue-1154-dsr1-fp8-b200-sglang

Klaud-Cold commented May 17, 2026

Uh oh!

github-actions Bot commented May 17, 2026

Uh oh!

github-actions Bot commented May 17, 2026

Uh oh!

github-actions Bot commented May 17, 2026

Uh oh!

claude Bot May 17, 2026

Uh oh!

github-actions Bot commented May 17, 2026

Uh oh!

github-actions Bot commented May 17, 2026

Uh oh!

functionstackx commented May 17, 2026

Uh oh!

Uh oh!

github-actions Bot commented May 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Klaud-Cold commented May 17, 2026

Uh oh!

github-actions Bot commented May 17, 2026

Uh oh!

github-actions Bot commented May 17, 2026

Uh oh!

github-actions Bot commented May 17, 2026

Uh oh!

claude Bot May 17, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented May 17, 2026

Uh oh!

github-actions Bot commented May 17, 2026

Uh oh!

functionstackx commented May 17, 2026

Uh oh!

Uh oh!

github-actions Bot commented May 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants