Skip to content

[AMD] feat: support multiple cluster for mi355x DI CI workflow#433

Closed
billishyahao wants to merge 2 commits into
SemiAnalysisAI:mainfrom
billishyahao:billhe/up_di_stage2
Closed

[AMD] feat: support multiple cluster for mi355x DI CI workflow#433
billishyahao wants to merge 2 commits into
SemiAnalysisAI:mainfrom
billishyahao:billhe/up_di_stage2

Conversation

@billishyahao
Copy link
Copy Markdown
Collaborator

@billishyahao billishyahao commented Jan 15, 2026

This patch is to support multiple cluster for MI355X distributed inference CI workflow.
This patch co-work with the branch sa-260114 of recipe https://github.com/billishyahao/sglang_disagg.git
Co-author: @ichbinblau

@billishyahao
Copy link
Copy Markdown
Collaborator Author

/sweep test-config --config-keys dsr1-fp8-mi355x-sglang-disagg --runner-config .github/configs/runners.yaml --config-files .github/configs/amd-master.yaml

@github-actions
Copy link
Copy Markdown
Contributor

@billishyahao Kicking off a sweep.

Run: https://github.com/InferenceMAX/InferenceMAX/actions/runs/21024978364
Command: test-config --config-keys dsr1-fp8-mi355x-sglang-disagg --runner-config .github/configs/runners.yaml --config-files .github/configs/amd-master.yaml
Pinned ref: 654caea
Approval: not required (trusted collaborator).

@billishyahao
Copy link
Copy Markdown
Collaborator Author

/sweep test-config --config-keys dsr1-fp8-mi355x-sglang-disagg --runner-config .github/configs/runners.yaml --config-files .github/configs/amd-master.yaml

@github-actions
Copy link
Copy Markdown
Contributor

@billishyahao Kicking off a sweep.

Run: https://github.com/InferenceMAX/InferenceMAX/actions/runs/21025811633
Command: test-config --config-keys dsr1-fp8-mi355x-sglang-disagg --runner-config .github/configs/runners.yaml --config-files .github/configs/amd-master.yaml
Pinned ref: 7fed049
Approval: not required (trusted collaborator).

@SemiAnalysisAI SemiAnalysisAI deleted a comment from claude Bot Jan 15, 2026
@SemiAnalysisAI SemiAnalysisAI deleted a comment from claude Bot Jan 15, 2026
@billishyahao
Copy link
Copy Markdown
Collaborator Author

17/18 cases succeeded and 1 failed. Should be an occasional occurence @cquil11 .https://github.com/InferenceMAX/InferenceMAX/actions/runs/21025811633

@billishyahao
Copy link
Copy Markdown
Collaborator Author

Use new PR https://github.com/InferenceMAX/InferenceMAX/pull/445. Close this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

Development

Successfully merging this pull request may close these issues.

2 participants