[SGLang Workflow] Upload benchmark results to AWS S3 #69

namanlalitnyu · 2025-08-27T20:16:28Z

These changes upload the SGLang benchmark results to the AWS S3 bucket, which internally triggers the AWS Lambda function to upload them to the clickhouse database. Eventually, being rendered in the HUD Dashboard.
For SGLang benchmarking, currently, the vllm bench serve command was generating a test.pytorch.json file containing the Pytorch formatted benchmark results, but it was hardcoding the benchmark name to vllm benchmark. Due to that, whenever the SGLang benchmarking was being run, it was also allocating the same benchmark name. This was leading to issues in the HUD dashboard where the clickhouse sql queries were trying to find the benchmark name as SGLang benchmark, but due to this issue, the results were returning an empty array.
So, added a check in the bash script to replace the benchmark name with correct one, as the original implementation is being done in the vllm code repository.

Testing:

Verified the changes from the Github workflow that the results are getting uploaded correctly to AWS S3

Verified the changes from the flambeau dashboard that the results are getting uploaded to Clickhouse database.

.github/scripts/run-sglang-performance-benchmarks.sh

huydhn · 2025-08-28T01:14:38Z

.github/workflows/sglang-benchmark.yml

            echo "⚠️ No benchmark results found in ${BENCHMARK_RESULTS}" >> $GITHUB_STEP_SUMMARY
          fi

+          python3 .github/scripts/upload_benchmark_results.py \


I notice that this workflow doesn't have a schedule yet. Are you planning to add one? I'm thinking daily, biweekly, or weekly depending on how frequent we need to look at sglang results.

Thanks for pointing it out, I have added it for weekly as of now, as we discussed initially. SGLang releases the new stable versions every monthly, but their pre-release versions mostly happen every week, so I think it would be fine to have it weekly for now. But, in future, we can update it to lesser/more as well based on the feedback.

I think weekly is a good starting point

.github/scripts/run-sglang-performance-benchmarks.sh

namanlalitnyu · 2025-08-28T16:27:07Z

sglang-benchmarks/benchmarks/cuda/serving-tests.json

            "dataset_path": "./ShareGPT_V3_unfiltered_cleaned_split.json",
            "num_prompts": 200
        }
+    },


Added a couple of tests for having more metrics on the SGLang dashboard.

Call the benchmarking script to upload benchmark results

aa8a786

meta-cla bot added the cla signed label Aug 27, 2025

namanlalitnyu had a problem deploying to pytorch-x-vllm August 27, 2025 20:16 — with GitHub Actions Error

namanlalitnyu changed the title ~~[SGLang Workflow] Upload benchmark results to S3 and Clickhouse~~ [SGLang Workflow] Upload benchmark results to AWS S3 Aug 27, 2025

replace benchmark name field

a45ee1f

namanlalitnyu had a problem deploying to pytorch-x-vllm August 27, 2025 20:46 — with GitHub Actions Error

namanlalitnyu had a problem deploying to pytorch-x-vllm August 27, 2025 21:28 — with GitHub Actions Error

trying to test something

edefd1d

namanlalitnyu temporarily deployed to pytorch-x-vllm August 27, 2025 21:49 — with GitHub Actions Inactive

try different approach

0b8123e

namanlalitnyu temporarily deployed to pytorch-x-vllm August 27, 2025 22:21 — with GitHub Actions Inactive

added better comments

361be3a

namanlalitnyu had a problem deploying to pytorch-x-vllm August 27, 2025 23:13 — with GitHub Actions Error

huydhn reviewed Aug 28, 2025

View reviewed changes

.github/scripts/run-sglang-performance-benchmarks.sh Show resolved Hide resolved

huydhn approved these changes Aug 28, 2025

View reviewed changes

huydhn reviewed Aug 28, 2025

View reviewed changes

add more serving tests, add scheduler, and use jq instead of sed

2d269f1

namanlalitnyu had a problem deploying to pytorch-x-vllm August 28, 2025 06:23 — with GitHub Actions Error

namanlalitnyu commented Aug 28, 2025

View reviewed changes

.github/scripts/run-sglang-performance-benchmarks.sh Outdated Show resolved Hide resolved

namanlalitnyu commented Aug 28, 2025

View reviewed changes

remove memory flag

8624cb2

namanlalitnyu had a problem deploying to pytorch-x-vllm August 28, 2025 18:13 — with GitHub Actions Error

namanlalitnyu had a problem deploying to pytorch-x-vllm August 29, 2025 00:31 — with GitHub Actions Failure

namanlalitnyu merged commit d1940e4 into main Aug 29, 2025
3 of 13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SGLang Workflow] Upload benchmark results to AWS S3 #69

[SGLang Workflow] Upload benchmark results to AWS S3 #69

Uh oh!

namanlalitnyu commented Aug 27, 2025 •

edited

Loading

Uh oh!

Uh oh!

huydhn Aug 28, 2025 •

edited

Loading

Uh oh!

namanlalitnyu Aug 28, 2025

Uh oh!

linzebing Aug 28, 2025

Uh oh!

Uh oh!

namanlalitnyu Aug 28, 2025

Uh oh!

Uh oh!

Uh oh!

[SGLang Workflow] Upload benchmark results to AWS S3 #69

[SGLang Workflow] Upload benchmark results to AWS S3 #69

Uh oh!

Conversation

namanlalitnyu commented Aug 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

huydhn Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

namanlalitnyu Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

linzebing Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

namanlalitnyu Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

namanlalitnyu commented Aug 27, 2025 •

edited

Loading

huydhn Aug 28, 2025 •

edited

Loading