Skip to content

CI: Refactor ROCm CI to use GPU-sized runners and build-only jobs#528

Merged
leo-automation merged 9 commits intodevfrom
leo/refactor-ci-gha
Apr 29, 2026
Merged

CI: Refactor ROCm CI to use GPU-sized runners and build-only jobs#528
leo-automation merged 9 commits intodevfrom
leo/refactor-ci-gha

Conversation

@leo-automation
Copy link
Copy Markdown
Collaborator

@leo-automation leo-automation commented Apr 7, 2026

Description

Split ROCm CI into dedicated build, single-GPU, multi-GPU, and examples jobs so each workload uses appropriately sized runners.

Specifics

  • run wheel builds on build-only-te
  • run examples on 1-GPU runners
  • run sGPU tests on 4-GPU runners
  • keep mGPU tests on 8-GPU runners
  • build and install explicit TE wheels in downstream jobs
  • use a non-GPU runner for AITER prebuilt upload

Tests

@leo-automation leo-automation marked this pull request as draft April 7, 2026 14:16
@leo-automation leo-automation added the ci-level 3 CI test level 3 label Apr 7, 2026
@leo-automation leo-automation marked this pull request as ready for review April 8, 2026 14:58
Comment thread .github/workflows/aiter-prebuilt-upload.yml Outdated
Comment thread .github/workflows/rocm-ci.yml
Comment thread .github/workflows/rocm-ci.yml Outdated
Comment thread .github/workflows/rocm-ci.yml Outdated
Comment thread .github/workflows/rocm-ci.yml Outdated
Comment thread .github/workflows/rocm-ci.yml Outdated
Comment thread .github/workflows/rocm-ci.yml
Comment thread .github/workflows/rocm-ci.yml Outdated
Comment thread .github/workflows/rocm-ci.yml Outdated
Comment thread .github/workflows/rocm-ci.yml
@leo-automation leo-automation requested a review from ipanfilo April 16, 2026 15:11
Comment thread .github/workflows/rocm-ci.yml Outdated
Comment thread .github/workflows/rocm-ci.yml Outdated
Comment thread .github/workflows/rocm-ci.yml
Comment thread .github/workflows/rocm-ci.yml
Comment thread .github/workflows/rocm-ci.yml
Comment thread .github/workflows/rocm-ci.yml Outdated
Comment thread .github/workflows/rocm-ci.yml Outdated
Comment thread .github/workflows/rocm-ci.yml
Comment thread .github/workflows/rocm-ci.yml Outdated
Comment thread .github/workflows/rocm-ci.yml Outdated
Comment thread ci/_utils.sh Outdated
Comment thread ci/_utils.sh Outdated
Comment thread .github/workflows/rocm-ci.yml Outdated
Comment thread .github/workflows/rocm-ci.yml Outdated
Comment thread .github/workflows/rocm-ci.yml Outdated
Comment thread .github/workflows/rocm-ci.yml Outdated
Copy link
Copy Markdown
Collaborator

@ipanfilo ipanfilo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@leo-automation leo-automation merged commit 6b96c46 into dev Apr 29, 2026
10 of 18 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci-level 3 CI test level 3

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants