Skip to content

Add inferencex-pr skill for ATOM benchmark comparison and PR creation #795

Merged
valarLip merged 2 commits into
ROCm:mainfrom
seungrokj:main
May 15, 2026
Merged

Add inferencex-pr skill for ATOM benchmark comparison and PR creation #795
valarLip merged 2 commits into
ROCm:mainfrom
seungrokj:main

Conversation

@seungrokj
Copy link
Copy Markdown
Contributor

Summary

  • Adds .claude/commands/inferencex-pr.md — a Claude Code skill that automates comparing ATOM upstream benchmark throughput against InferenceX
    results and creating PRs to InferenceX when ATOM performance leads.

What the skill does

  1. Prompts the user to select a target model (Kimi-K2.5, DeepSeek-V4-Pro, DeepSeek-R1-0528, Qwen-3.5-397B-A17B, GLM-5, MiniMax-M2.5,
    gpt-oss-120b)
  2. Fetches data from InferenceX API and the latest successful ATOM benchmark GitHub Actions run
  3. Builds a throughput regression table (tput_per_gpu) comparing matching ISL/OSL/concurrency/TP/precision configurations
  4. If ATOM outperforms InferenceX, creates a PR to SemiAnalysisAI/InferenceX with updated docker image and serve arguments

seungrokj and others added 2 commits May 15, 2026 12:39
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Renames the skill to better reflect its purpose: comparing ATOM upstream
benchmark results against InferenceX and reporting regression/improvement.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@valarLip valarLip merged commit 7516968 into ROCm:main May 15, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants