-
Notifications
You must be signed in to change notification settings - Fork 152
Pull requests: SemiAnalysisAI/InferenceX
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
dsv4-fp4-b300-vllm: bump to vllm v0.20.0, deep_gemm_mega_moe MoE
sweep-enabled
#1206
opened Apr 28, 2026 by
functionstackx
Contributor
Loading…
3 tasks
dsv4-fp4-b200-vllm: bump to vllm v0.20.0, deep_gemm_mega_moe MoE
full-sweep-enabled
#1204
opened Apr 28, 2026 by
functionstackx
Contributor
Loading…
4 tasks
Add dsv4-fp4-b300-vllm-mtp config (DSv4 vLLM B300 + MTP)
sweep-enabled
#1203
opened Apr 28, 2026 by
Oseltamivir
Collaborator
Loading…
6 tasks
Aiter MHC fix and keep DSv4 ATOM conc1
full-sweep-enabled
#1202
opened Apr 27, 2026 by
Oseltamivir
Collaborator
Loading…
[AMD/ROCM] Update gptoss-fp4-mi355x-atom config
AMD
#1195
opened Apr 27, 2026 by
seungrokj
Collaborator
Loading…
1 task
[AMD/ROCM] Update minimaxm2.5-fp8-mi355x-atom config
AMD
#1194
opened Apr 27, 2026 by
seungrokj
Collaborator
Loading…
1 task
dsv4-b300-sglang: update points
sweep-enabled
#1179
opened Apr 26, 2026 by
yhyang201
Collaborator
Loading…
gb300 1k1k sglang
sweep-enabled
#1169
opened Apr 26, 2026 by
Oseltamivir
Collaborator
Loading…
4 of 5 tasks
[DON'T MERGE] [NV] dsv4-fp4-gb200-dynamo-vllm
full-sweep-enabled
#1163
opened Apr 26, 2026 by
Ankur-singh
Collaborator
Loading…
Day 0 DeepSeek V4 Pro FP4 GB200 disaggregated SGLang benchmarks
sweep-enabled
#1157
opened Apr 25, 2026 by
Oseltamivir
Collaborator
Loading…
4 of 5 tasks
Day 0 GB300 DeepSeek-V4-Pro FP4 vLLM disagg
sweep-enabled
#1150
opened Apr 25, 2026 by
Oseltamivir
Collaborator
Loading…
3 tasks
[AMD/ROCM] Qwen3.5-397B-A17B BF16 MI355X Atom benchmarks
#1149
opened Apr 25, 2026 by
seungrokj
Collaborator
Loading…
[NVIDIA] chore: B200 single node DeepSeek v4 SGLang MTP
NVIDIA
sweep-enabled
#1145
opened Apr 24, 2026 by
cquil11
Collaborator
Loading…
1 task
Add H100 config: dsv4-fp8-dynamo-vllm (DeepSeek-V4-Pro multinode disagg)
sweep-enabled
#1142
opened Apr 24, 2026 by
Oseltamivir
Collaborator
Loading…
Add DeepSeek-V4-Pro SGLang aggregated GB200 benchmarks (NVIDIA srt-slurm PR #69)
sweep-enabled
#1137
opened Apr 24, 2026 by
Oseltamivir
Collaborator
Loading…
3 of 5 tasks
[AMD/ROCM] atom qwen3.5 fp4 on mi355x
AMD
#1133
opened Apr 24, 2026 by
seungrokj
Collaborator
Loading…
1 task
[AMD/ROCM] atom glm5 fp8 on mi355x
AMD
#1126
opened Apr 24, 2026 by
seungrokj
Collaborator
Loading…
2 tasks
[AMD/ROCM] GLM5.1 FP8 MTP Support on MI355X
AMD
#1122
opened Apr 23, 2026 by
ajith-sirra-amd
Contributor
Loading…
[WIP] Allow overriding srt-slurm repo/ref at the launcher level
#1118
opened Apr 22, 2026 by
Oseltamivir
Collaborator
Loading…
[AMD/ROCm] Add Kimi-K2.5 FP4 vLLM Eagle3 speculative decoding config for MI355X
#1116
opened Apr 22, 2026 by
chunfangamd
Collaborator
•
Draft
[AMD/Hyperloom] Tune dsr1-fp8-mi355x-sglang: --num-continuous-decode-steps 4 → 8
#1109
opened Apr 21, 2026 by
lishuoshuo-amd
Loading…
4 tasks done
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.