Add DeepSeek-V3 and DeepSeek-R1 on B200 #78

huydhn · 2025-09-11T01:54:34Z

The main change is to add DeepSeek-V3 and DeepSeek-R1 on B200. To achieve this, I introduce a PLATFORM_SKIPS configuration to skip these models on A100 and H100. I'm also using this new configuration to shift Llama4 and bigger Gemma3 and Qwen3 to B200 while keeping Llama3 and smaller variants of these twos on A100/H100 (more coverage I guess).

For reference: configs for internal jobs for these models P1930074197

Signed-off-by: Huy Do <huydhn@gmail.com>

.github/workflows/vllm-benchmark.yml

Signed-off-by: Huy Do <huydhn@gmail.com>

huydhn · 2025-09-11T07:25:47Z

The benchmark surfaces this recent error https://github.com/pytorch/pytorch-integration-testing/actions/runs/17631888148/job/50100699186#step:16:5520 from vLLM vllm-project/vllm#23582 (only on B200 I think). So, there are some missing metrics like latency or token/s for now. There is probably no action needed on the CI.

Signed-off-by: Huy Do <huydhn@gmail.com>

huydhn added 3 commits September 10, 2025 18:45

Add DeepSeek-V3 and DeepSeek-R1 on B200

5109b4f

Signed-off-by: Huy Do <huydhn@gmail.com>

Comment

37f5e1a

Signed-off-by: Huy Do <huydhn@gmail.com>

Format

a9699eb

Signed-off-by: Huy Do <huydhn@gmail.com>

huydhn requested a review from BoyuanFeng September 11, 2025 01:54

meta-cla bot added the cla signed label Sep 11, 2025

huydhn commented Sep 11, 2025

View reviewed changes

.github/workflows/vllm-benchmark.yml Show resolved Hide resolved

Minor bug

9ae68ae

Signed-off-by: Huy Do <huydhn@gmail.com>

huydhn had a problem deploying to pytorch-x-vllm September 11, 2025 01:57 — with GitHub Actions Error

huydhn requested a deployment to pytorch-x-vllm September 11, 2025 01:57 — with GitHub Actions In progress

huydhn had a problem deploying to pytorch-x-vllm September 11, 2025 01:57 — with GitHub Actions Error

huydhn temporarily deployed to pytorch-x-vllm September 11, 2025 02:09 — with GitHub Actions Inactive

BoyuanFeng approved these changes Sep 11, 2025

View reviewed changes

huydhn added 2 commits September 10, 2025 22:46

Skip gemma3 on ROCm

25d9bf1

Signed-off-by: Huy Do <huydhn@gmail.com>

A bit more tweak

0ef5a20

Signed-off-by: Huy Do <huydhn@gmail.com>

huydhn temporarily deployed to pytorch-x-vllm September 11, 2025 06:51 — with GitHub Actions Inactive

[no ci] Benchmark DeepSeek is a bit slow

4baa53c

Signed-off-by: Huy Do <huydhn@gmail.com>

huydhn merged commit 963053c into main Sep 11, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add DeepSeek-V3 and DeepSeek-R1 on B200 #78

Add DeepSeek-V3 and DeepSeek-R1 on B200 #78

Uh oh!

huydhn commented Sep 11, 2025

Uh oh!

Uh oh!

huydhn commented Sep 11, 2025

Uh oh!

Uh oh!

Uh oh!

Add DeepSeek-V3 and DeepSeek-R1 on B200 #78

Add DeepSeek-V3 and DeepSeek-R1 on B200 #78

Uh oh!

Conversation

huydhn commented Sep 11, 2025

Uh oh!

Uh oh!

huydhn commented Sep 11, 2025

Uh oh!

Uh oh!

Uh oh!