Skip to content

[AMD] Update search-space for ATOM DSR1 configs#699

Merged
cquil11 merged 8 commits intoSemiAnalysisAI:mainfrom
ChuanLi1101:patch-1
Feb 25, 2026
Merged

[AMD] Update search-space for ATOM DSR1 configs#699
cquil11 merged 8 commits intoSemiAnalysisAI:mainfrom
ChuanLi1101:patch-1

Conversation

@ChuanLi1101
Copy link
Contributor

@ChuanLi1101 ChuanLi1101 commented Feb 13, 2026

Summary

  • Consolidate DSR1 FP4 MI355X ATOM & ATOM-MTP search space to TP=8 only (TP=4 commented out)
  • Extend concurrency range to 256 across all sequence lengths (1k1k, 1k8k, 8k1k)
  • Fix MTP 1k8k conc-start from 256 → 4 to enable full concurrency sweep

ChuanLi1101

This comment was marked as outdated.

@ChuanLi1101
Copy link
Contributor Author

The original search space was configured with limited settings and didn’t fully cover the Pareto frontier, especially for the newly added MTP cases. We’ve updated the configuration so we can now observe the full curve in the full sweep.

@seungrokj
Copy link
Collaborator

/sweep test-config --config-files .github/configs/amd-master.yaml --runner-config .github/configs/runners.yaml --config-keys dsr1-fp4-mi355x-atom-mtp dsr1-fp4-mi355x-atom

@github-actions
Copy link
Contributor

@seungrokj Kicking off a sweep.

Run: https://github.com/InferenceMAX/InferenceMAX/actions/runs/21997709058
Command: test-config --config-files .github/configs/amd-master.yaml --runner-config .github/configs/runners.yaml --config-keys dsr1-fp4-mi355x-atom-mtp dsr1-fp4-mi355x-atom
Pinned ref: 4c44854
Approval: not required (trusted collaborator).

@functionstackx
Copy link
Contributor

@pr-claude review plz

Reduce the search space to depends less on the compute as most of the pareto frontier fall under TP=8.
@functionstackx
Copy link
Contributor

@claude add perf changelog.yaml to dsr1-fp4-mi355x-atom & dsr1-fp4-mi355x-atom-mtp and add an description of change and then review the PR plz. lgtm to me but want ur second eye claude please

@Klaud-Cold
Copy link
Collaborator

Claude Code is working…

I'll analyze this and get back to you.

View job run

@cquil11
Copy link
Collaborator

cquil11 commented Feb 23, 2026

@claude add perf changelog.yaml to dsr1-fp4-mi355x-atom & dsr1-fp4-mi355x-atom-mtp and add an description of change and then review the PR plz. lgtm to me but want ur second eye claude please

@Klaud-Cold
Copy link
Collaborator

Claude Code is working…

I'll analyze this and get back to you.

View job run

@cquil11
Copy link
Collaborator

cquil11 commented Feb 23, 2026

@seungrokj @ChuanLi1101 Was it your intention to comment out TP 4 scenarios??

@cquil11 cquil11 changed the title Update search-space configurations in amd-master.yaml [AMD] Update search-space for ATOM DSR1 configs Feb 23, 2026
@seungrokj
Copy link
Collaborator

@seungrokj @ChuanLi1101 Was it your intention to comment out TP 4 scenarios??
I left a comment inline. I think we should keep tp4 for certain concurrencies.

@ChuanLi1101 @cquil11

Revise the changes to make sure we get the pareto's frontier.
update the search space
Copy link
Contributor Author

@ChuanLi1101 ChuanLi1101 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Reduce the search space to keep only the pareto's frontier points.

Copy link
Collaborator

@cquil11 cquil11 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@cquil11 cquil11 merged commit dee9cf7 into SemiAnalysisAI:main Feb 25, 2026
cquil11 added a commit that referenced this pull request Feb 25, 2026
cquil11 added a commit that referenced this pull request Feb 25, 2026
cquil11 added a commit that referenced this pull request Feb 25, 2026
cquil11 added a commit that referenced this pull request Feb 25, 2026
* Revert "Revert "[AMD] Update search-space for ATOM DSR1 configs (#699)" (#791)"

This reverts commit 96e54ca.

* Update amd-master.yaml
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

Development

Successfully merging this pull request may close these issues.

5 participants