[Benchmark] Adding "facebook/opt-125m" model in the benchmarking tests for both vLLM and SGLang #76

namanlalitnyu · 2025-09-10T16:44:59Z

Changes:

This PR involves adding facebook/opt-125m model in the benchmarking tests (serving, latency, and throughput) for both vLLM and SGLang frameworks.
Based on our discussion, we need to remove the serving tests for qwen model for SGLang benchmarking.
Fixing issue related to the SGLang benchmarks failing.

Testing:

Github Action for SGLang benchmarking containing the facebook model: link
Github Action for vLLM benchmarking containing the facebook model: link
HUD Dashboard also showing the facebook model benchmarking results: link

namanlalitnyu · 2025-09-12T06:23:57Z

One of the Github actions for qwen model is failing, and its unrelated to our changes, as this PR doesn't touch the vllm-benchmark workflow, and also the test fails due to the vllm server not starting. Hence, we are good with merging these changes.

Added facebook model in the benchmark tests

b4183b8

meta-cla bot added the cla signed label Sep 10, 2025

facebook-github-bot added the module: rocm label Sep 10, 2025

namanlalitnyu temporarily deployed to pytorch-x-vllm September 10, 2025 16:45 — with GitHub Actions Inactive

namanlalitnyu had a problem deploying to pytorch-x-vllm September 10, 2025 16:45 — with GitHub Actions Error

namanlalitnyu temporarily deployed to pytorch-x-vllm September 10, 2025 16:45 — with GitHub Actions Inactive

namanlalitnyu had a problem deploying to pytorch-x-vllm September 10, 2025 16:45 — with GitHub Actions Error

namanlalitnyu temporarily deployed to pytorch-x-vllm September 10, 2025 16:45 — with GitHub Actions Inactive

namanlalitnyu had a problem deploying to pytorch-x-vllm September 10, 2025 16:45 — with GitHub Actions Error

namanlalitnyu temporarily deployed to pytorch-x-vllm September 10, 2025 16:45 — with GitHub Actions Inactive

namanlalitnyu had a problem deploying to pytorch-x-vllm September 10, 2025 16:45 — with GitHub Actions Error

fixing model length issue

6399962

namanlalitnyu had a problem deploying to pytorch-x-vllm September 10, 2025 17:29 — with GitHub Actions Error

namanlalitnyu temporarily deployed to pytorch-x-vllm September 10, 2025 17:29 — with GitHub Actions Inactive

namanlalitnyu temporarily deployed to pytorch-x-vllm September 11, 2025 20:30 — with GitHub Actions Inactive

namanlalitnyu had a problem deploying to pytorch-x-vllm September 11, 2025 20:30 — with GitHub Actions Error

namanlalitnyu temporarily deployed to pytorch-x-vllm September 11, 2025 20:30 — with GitHub Actions Inactive

namanlalitnyu had a problem deploying to pytorch-x-vllm September 11, 2025 20:30 — with GitHub Actions Error

namanlalitnyu temporarily deployed to pytorch-x-vllm September 11, 2025 20:30 — with GitHub Actions Inactive

namanlalitnyu temporarily deployed to pytorch-x-vllm September 12, 2025 02:30 — with GitHub Actions Inactive

namanlalitnyu had a problem deploying to pytorch-x-vllm September 12, 2025 02:30 — with GitHub Actions Failure

namanlalitnyu had a problem deploying to pytorch-x-vllm September 12, 2025 05:38 — with GitHub Actions Failure

namanlalitnyu merged commit 774075f into main Sep 12, 2025
76 of 80 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Benchmark] Adding "facebook/opt-125m" model in the benchmarking tests for both vLLM and SGLang #76

[Benchmark] Adding "facebook/opt-125m" model in the benchmarking tests for both vLLM and SGLang #76

Uh oh!

namanlalitnyu commented Sep 10, 2025 •

edited

Loading

Uh oh!

namanlalitnyu commented Sep 12, 2025

Uh oh!

Uh oh!

Uh oh!

[Benchmark] Adding "facebook/opt-125m" model in the benchmarking tests for both vLLM and SGLang #76

[Benchmark] Adding "facebook/opt-125m" model in the benchmarking tests for both vLLM and SGLang #76

Uh oh!

Conversation

namanlalitnyu commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes:

Testing:

Uh oh!

namanlalitnyu commented Sep 12, 2025

Uh oh!

Uh oh!

Uh oh!

namanlalitnyu commented Sep 10, 2025 •

edited

Loading