-
Notifications
You must be signed in to change notification settings - Fork 175
Update dpskv4 GB300 non-MTP disagg SGLang image to nightly-20260520 #1528
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -33,7 +33,7 @@ name: "disagg-gb300-10p1d-dep4-dep16-14-c8192" | |
|
|
||
| model: | ||
| path: "deepseek-v4-pro" | ||
| container: "lmsysorg/sglang:nightly-dev-cu13-20260519-dbac4647" | ||
| container: "lmsysorg/sglang:nightly-dev-cu13-20260520-425dffbd" | ||
| precision: "fp4" | ||
|
|
||
| dynamo: | ||
|
|
@@ -94,7 +94,6 @@ backend: | |
| SGLANG_LOG_FORWARD_ITERS: "1" | ||
| SGLANG_LOG_MS: "1" | ||
| SGLANG_REQUEST_STATE_WAIT_TIMEOUT: "60" | ||
| SGLANG_OPT_FP8_WO_A_GEMM: "0" | ||
|
|
||
| decode_environment: | ||
| PYTHONUNBUFFERED: "1" | ||
|
|
@@ -119,7 +118,6 @@ backend: | |
| SGLANG_LOG_FORWARD_ITERS: "1" | ||
| SGLANG_LOG_MS: "1" | ||
| SGLANG_REQUEST_STATE_WAIT_TIMEOUT: "60" | ||
| SGLANG_OPT_FP8_WO_A_GEMM: "0" | ||
| # is single-node only and corrupts results in 2-node decode setups. | ||
|
Comment on lines
118
to
121
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. 🟡 The PR removes the Extended reasoning...What the bug isThe PR removes Step-by-step proof (using
|
||
|
|
||
| sglang_config: | ||
|
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🟡 Missing
perf-changelog.yamlentry for this image bump. The immediately-preceding PR #1492 (20260518 → 20260519 bump of this samedsv4-fp4-gb300-dynamo-sglangconfig-key) added an explicit entry under that key, and other recent image-bump PRs (#1411, #1444, #1475) followed the same convention. Consider adding a parallel entry to keep the changelog consistent (also worth noting theSGLANG_OPT_FP8_WO_A_GEMM=0removal, which is a functional change worth recording).Extended reasoning...
What's missing
This PR bumps the SGLang image for the
dsv4-fp4-gb300-dynamo-sglangconfig-key (in.github/configs/nvidia-master.yaml:8762) fromnightly-dev-cu13-20260519-dbac4647tonightly-dev-cu13-20260520-425dffbdand, alongside that, removes theSGLANG_OPT_FP8_WO_A_GEMM=0workaround from sixdisagg-gb300-*.yamlrecipes (PR description: "fixed in 0520 nightly via sgl-project/sglang#25805"). It does not add an entry toperf-changelog.yaml.Why this is a convention break
The immediately-preceding PR for this same config-key — #1492 (commit
80c944e, 20260518 → 20260519) — added an explicit entry toperf-changelog.yamlat lines 3020–3024:The same pattern shows up across other recent image-bump PRs:
perf-changelog.yamlperf-changelog.yaml(+7 lines)The current PR (
fa55687) modifies 7 files (.github/configs/nvidia-master.yaml+ sixdisagg-gb300-*.yamlrecipes) but does not touchperf-changelog.yamlat all.Step-by-step proof
git show 80c944e --statfor PR Update dpskv4 GB300 non-MTP disagg SGLang image to nightly-20260519 #1492 showsperf-changelog.yaml | 14 ++++++++— i.e. the 20260518→20260519 bump added a changelog entry.perf-changelog.yamllines 3012–3024 still contain that entry underdsv4-fp4-gb300-dynamo-sglang.git show fa55687 --statfor the current PR lists 7 modified files:.github/configs/nvidia-master.yamlplus the sixdisagg-gb300-*.yamlrecipes.perf-changelog.yamlis not in the list.SGLANG_OPT_FP8_WO_A_GEMM=0from prefill+decode environments in 6 recipes), which is arguably even more worth recording.Impact
This is a documentation/observability concern, not a runtime bug — the recipes themselves will run fine. The missed entry only affects the historical perf-tracking trail for this config-key. Given that the previous bump (one day earlier, same author) did add the entry, this looks more like an oversight than an intentional skip.
Suggested fix
Add a
perf-changelog.yamlentry underdsv4-fp4-gb300-dynamo-sglangalong the lines of: