Adjust MiniMax MI355X block size for TP8 EP8 by jiacao-amd · Pull Request #1228 · SemiAnalysisAI/InferenceX

jiacao-amd · 2026-04-29T16:33:52Z

Summary

default MiniMax MI355X vLLM runs to block size 16 with shuffled KV cache layout enabled
special-case TP8/EP8 to disable shuffled KV cache layout and use block size 32

Testing

bash -n benchmarks/single_node/minimaxm2.5_fp8_mi355x.sh

claude

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

github-actions · 2026-04-29T16:46:16Z

@jiacao-amd Kicking off a sweep.

Run: https://github.com/SemiAnalysisAI/InferenceX/actions/runs/25121853688
Command: test-config --config-files .github/configs/amd-master.yaml --config-keys minimaxm2.5-fp8-mi355x-vllm
Pinned ref: d66409b
Approval: not required (trusted collaborator).

jiacao-amd · 2026-05-04T16:36:09Z

/sweep test-config --config-files .github/configs/amd-master.yaml --config-keys minimaxm2.5-fp8-mi355x-vllm

github-actions · 2026-05-04T16:36:21Z

@jiacao-amd Kicking off a sweep.

Run: https://github.com/SemiAnalysisAI/InferenceX/actions/runs/25330781481
Command: test-config --config-files .github/configs/amd-master.yaml --config-keys minimaxm2.5-fp8-mi355x-vllm
Pinned ref: a34fb25
Approval: not required (trusted collaborator).

jiacao-amd · 2026-05-04T18:34:09Z

/sweep test-config --config-files .github/configs/amd-master.yaml --config-keys minimaxm2.5-fp8-mi355x-vllm

github-actions · 2026-05-04T18:34:19Z

@jiacao-amd Kicking off a sweep.

Run: https://github.com/SemiAnalysisAI/InferenceX/actions/runs/25336346089
Command: test-config --config-files .github/configs/amd-master.yaml --config-keys minimaxm2.5-fp8-mi355x-vllm
Pinned ref: 37962d4
Approval: not required (trusted collaborator).

jiacao-amd · 2026-05-04T20:26:45Z

Superseded by #1276. The replacement PR uses the same MiniMax MI355X vLLM scheduling change, but the branch is pushed directly to SemiAnalysisAI/InferenceX instead of the fork so CI/automation should avoid the fork-PR permission issues.

jiacao-amd requested a review from a team April 29, 2026 16:33

github-project-automation Bot added this to InferenceMAX Board Apr 29, 2026

claude Bot reviewed Apr 29, 2026

View reviewed changes

Adjust MiniMax block size for TP8 EP8

d66409b

jiacao-amd force-pushed the minimax-block16-tp8ep8-block32 branch from c01e0b6 to d66409b Compare April 29, 2026 16:41

Use async scheduling only for MiniMax c128

a34fb25

Tune MiniMax MI355X shuffle and async thresholds

37962d4

jiacao-amd force-pushed the minimax-block16-tp8ep8-block32 branch from 9c6bfd2 to 37962d4 Compare May 4, 2026 18:32

jiacao-amd mentioned this pull request May 4, 2026

Tune MiniMax MI355X vLLM scheduling thresholds #1276

Open

jiacao-amd closed this May 4, 2026

github-project-automation Bot moved this to Done in InferenceMAX Board May 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adjust MiniMax MI355X block size for TP8 EP8#1228

Adjust MiniMax MI355X block size for TP8 EP8#1228
jiacao-amd wants to merge 3 commits intoSemiAnalysisAI:mainfrom
jiacao-amd:minimax-block16-tp8ep8-block32

jiacao-amd commented Apr 29, 2026

Uh oh!

claude Bot left a comment

Uh oh!

github-actions Bot commented Apr 29, 2026

Uh oh!

jiacao-amd commented May 4, 2026

Uh oh!

github-actions Bot commented May 4, 2026

Uh oh!

jiacao-amd commented May 4, 2026

Uh oh!

github-actions Bot commented May 4, 2026

Uh oh!

jiacao-amd commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jiacao-amd commented Apr 29, 2026

Summary

Testing

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Claude Code Review

Uh oh!

github-actions Bot commented Apr 29, 2026

Uh oh!

jiacao-amd commented May 4, 2026

Uh oh!

github-actions Bot commented May 4, 2026

Uh oh!

jiacao-amd commented May 4, 2026

Uh oh!

github-actions Bot commented May 4, 2026

Uh oh!

jiacao-amd commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant