Skip to content

Mi355#455

Merged
msaroufim merged 20 commits into
mainfrom
mi355
Mar 4, 2026
Merged

Mi355#455
msaroufim merged 20 commits into
mainfrom
mi355

Conversation

@msaroufim
Copy link
Copy Markdown
Member

No description provided.

Mark Saroufim added 4 commits March 3, 2026 18:32
Add MI355X to GitHubGPU enum, GPU_TO_SM mapping, and github launcher
runner routing with runner label mia1-p02-g29.
Add container image ghcr.io/gpu-mode/amd-runner:main with GPU device
passthrough to amd_workflow.yml. Add numpy to AMD_REQUIREMENTS.
- Upgrade ROCm from 6.3.1 to 7.2
- Upgrade PyTorch to nightly rocm7.2
- Update aiter to latest commit (f3be04a) for recent FP4 kernel APIs
- Remove UCX, OpenMPI, and rocSHMEM builds (no longer needed)
@github-actions
Copy link
Copy Markdown

github-actions Bot commented Mar 4, 2026

Coverage report

Click to see where and how coverage changed

FileStatementsMissingCoverageCoverage
(new stmts)
Lines missing
  src/libkernelbot
  consts.py
  utils.py
Project Total  

This report was generated by python-coverage-comment-action

Mark Saroufim added 16 commits March 3, 2026 19:04
…GPU deps

- Upgrade ROCm from 6.3.1 to 7.1 (stable, matches host ROCm 7.0.1)
- Use stable torch 2.10.0+rocm7.1 instead of nightly
- Update aiter to latest commit (f3be04a) for recent FP4 kernel APIs
- Remove UCX, OpenMPI, and rocSHMEM builds
Fixes EACCES errors from root-owned files left by previous container runs.
- Replace python3.10 packages with python3 equivalents
- Use noble ROCm package instead of jammy
- Add --break-system-packages for pip on Noble
- Remove git-core PPA (not needed on Noble)
- Remove linux-headers install (not available during build)
Ensures the workflow timeout is at least 30 minutes to account for
Docker image pulls and container initialization on new runners.
@msaroufim msaroufim merged commit ee0919e into main Mar 4, 2026
3 of 5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant