Skip to content

Add a comment on the use of float16 and set some EVs explicitly#69

Merged
functionstackx merged 4 commits intomainfrom
qcolombe-0927-rc1-cmtfix-mi355x-tweak
Sep 28, 2025
Merged

Add a comment on the use of float16 and set some EVs explicitly#69
functionstackx merged 4 commits intomainfrom
qcolombe-0927-rc1-cmtfix-mi355x-tweak

Conversation

@qcolombet
Copy link
Copy Markdown
Contributor

No description provided.

qcolombet and others added 4 commits September 28, 2025 22:00
The default for the environment variable `VLLM_ROCM_USE_AITER_MHA`
changed in the 09/27 RC1 docker.
Set the variable explicitly to prevent some performance differences.
@JArnoldAMD
Copy link
Copy Markdown
Collaborator

Fixed up a couple of envvar settings for Llama 70B FP4 in the 8192/1024 configs

@JArnoldAMD JArnoldAMD marked this pull request as ready for review September 28, 2025 21:35
@functionstackx functionstackx merged commit ea30e4f into main Sep 28, 2025
@functionstackx functionstackx deleted the qcolombe-0927-rc1-cmtfix-mi355x-tweak branch September 28, 2025 22:32
Oseltamivir added a commit that referenced this pull request Apr 24, 2026
NVIDIA srt-slurm PR #69 recipes set `slurm.partition: gb200` (or gb300),
which doesn't exist on our cluster — sbatch rejects the submission with
"invalid partition specified". srtctl's config.slurm.partition takes
precedence over the SLURM_PARTITION env var, so we rewrite both names
to $SLURM_PARTITION in all cloned recipe YAMLs immediately after
checkout.
Oseltamivir added a commit that referenced this pull request Apr 25, 2026
srtctl SrtConfig schema rejects backend.connector for the sglang
backend type. The field was carried over from the dynamo-vllm dsv4
recipes (where it is valid and set to null). PR #69/#75 sglang
recipes upstream do not declare it.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants