Add instructions to run benchmarks #480

liu-cong · 2025-03-12T16:12:41Z

No description provided.

netlify · 2025-03-12T16:13:05Z

✅ Deploy Preview for gateway-api-inference-extension ready!

Name	Link
🔨 Latest commit	`dda25ac`
🔍 Latest deploy log	https://app.netlify.com/sites/gateway-api-inference-extension/deploys/67d9e5d24684eb00087b4f0b
😎 Deploy Preview	https://deploy-preview-480--gateway-api-inference-extension.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

liu-cong · 2025-03-12T16:14:41Z

/hold

benchmark/README.md

ahg-g · 2025-03-13T14:46:31Z

benchmark/manifests/BenchmarkInferenceExtension.yaml

@@ -0,0 +1,60 @@
+apiVersion: apps/v1
+kind: Deployment


@achandrasekar How would one start another run? Should we use a Job here instead, something that runs to completion?

I thought about this as well. A deployment is convenient in that it keeps the pods running so we can download the result files from the pod, otherwise we need to set up some persistent storage such as s3 or GCS, not every user has access to those. This is also aligns with the user guide of the lpg tool.

You can give users option to export the result to s3 or GCS in the job.

I think the pod/job/files stays around after it completes, so should still be able to d/l the results?

You can give users option to export the result to s3 or GCS in the job.

I took the approach that requires minimal dependencies. Yes using a persistent volumes such as S3 works as well, but it requires additional configuration. We can add that option later.

I think the pod/job/files stays around after it completes, so should still be able to d/l the results?

You will need some persistent volume.

I updated the download-benchmark-result.sh script to tear down the deployment after it downloads the results.

ahg-g · 2025-03-13T17:24:50Z

This worked for me, thanks!

A couple of things to improve:

How to trigger multiple runs?
How would the user know that the benchmark finished?

ahg-g · 2025-03-13T17:51:31Z

A couple more things:

what is the plan for the image?
where can the user learn about the config options for the benchmark?

ahg-g

We can now link to the configuration options: https://github.com/AI-Hypercomputer/inference-benchmark?tab=readme-ov-file#configuring-the-benchmark

liu-cong · 2025-03-17T23:11:14Z

How to trigger multiple runs?

You need to re-deploy the benchmark tool. Note further orchestration of the tool such as automating multiple runs is out of the scope for this PR. Though we can revisit.

How would the user know that the benchmark finished?

Updated README

kfswain · 2025-03-17T23:18:45Z

Benchmark works for me! Thanks!

Some smaller items to consider:

We may want to move this README to be part of site-src/ so that the guide can be on our website, it may be odd to have guides in different places
moving any k8s manifests to config/manifests
snake casing the yaml files (we've let some kebab cased files in, but I think PascalCasing sticks out a tad more)

christian-posta

I've taken the files and run them. Seems to work. Left some comments. Thanks!

benchmark/manifests/ModelServerService.yaml

benchmark/manifests/BenchmarkK8sService.yaml

christian-posta · 2025-03-18T04:12:53Z

benchmark/manifests/BenchmarkInferenceExtension.yaml

@@ -0,0 +1,60 @@
+apiVersion: apps/v1
+kind: Deployment


I think the pod/job/files stays around after it completes, so should still be able to d/l the results?

benchmark/README.md

liu-cong · 2025-03-18T17:44:30Z

Some smaller items to consider:

We may want to move this README to be part of site-src/ so that the guide can be on our website, it may be odd to have guides in different places
moving any k8s manifests to config/manifests
snake casing the yaml files (we've let some kebab cased files in, but I think PascalCasing sticks out a tad more)

Thanks! Updated as you suggested.

site-src/performance/benchmark/index.md

ahg-g · 2025-03-18T21:14:17Z

/approve

One nit to fix the link. However, this is still not very practical, we need a published helm chart so that users can run the benchmark without needing to fork.

k8s-ci-robot · 2025-03-18T21:14:25Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ahg-g, liu-cong

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [ahg-g]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

liu-cong · 2025-03-18T21:31:39Z

we need a published helm chart so that users can run the benchmark without needing to fork.

makes sense, created #528 to follow up on this.

liu-cong · 2025-03-18T21:32:10Z

/unhold

ahg-g · 2025-03-18T21:39:30Z

/lgtm

k8s-ci-robot added the cncf-cla: yes label Mar 12, 2025

k8s-ci-robot requested review from Jeffwan and robscott March 12, 2025 16:12

k8s-ci-robot added the size/XL label Mar 12, 2025

liu-cong force-pushed the benchmark branch from 4d23a65 to f7a1c7b Compare March 12, 2025 16:13

k8s-ci-robot added the do-not-merge/hold label Mar 12, 2025

ahg-g reviewed Mar 12, 2025

View reviewed changes

benchmark/README.md Outdated Show resolved Hide resolved

benchmark/README.md Outdated Show resolved Hide resolved

Add instructions to run benchmarks

265773a

liu-cong force-pushed the benchmark branch from f7a1c7b to 265773a Compare March 12, 2025 21:41

ahg-g reviewed Mar 13, 2025

View reviewed changes

danehans mentioned this pull request Mar 13, 2025

Initial Gateway API Inference Extension Blog Post kubernetes/website#49898

Open

ahg-g reviewed Mar 13, 2025

View reviewed changes

kfswain self-requested a review March 14, 2025 20:25

Address comments

3c5965f

christian-posta reviewed Mar 18, 2025

View reviewed changes

liu-cong added 2 commits March 18, 2025 10:37

Move benchmark guide to site-src and other cleanups

125dcdb

Add source code link for the benchmark tool image

bc28eef

ahg-g reviewed Mar 18, 2025

View reviewed changes

site-src/performance/benchmark/index.md Outdated Show resolved Hide resolved

k8s-ci-robot added the approved label Mar 18, 2025

Address nit

dda25ac

liu-cong mentioned this pull request Mar 18, 2025

Add a helm chart to parameterize the benchmark guide so users don't need to fork the repo #528

Open

k8s-ci-robot removed the do-not-merge/hold label Mar 18, 2025

k8s-ci-robot assigned ahg-g Mar 18, 2025

k8s-ci-robot added the lgtm label Mar 18, 2025

k8s-ci-robot merged commit 64ba0c6 into kubernetes-sigs:main Mar 18, 2025
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add instructions to run benchmarks #480

Add instructions to run benchmarks #480

liu-cong commented Mar 12, 2025

netlify bot commented Mar 12, 2025 •

edited

Loading

liu-cong commented Mar 12, 2025

ahg-g Mar 13, 2025

liu-cong Mar 17, 2025

xzhaosg Mar 17, 2025

christian-posta Mar 18, 2025

liu-cong Mar 18, 2025

ahg-g commented Mar 13, 2025

ahg-g commented Mar 13, 2025

ahg-g left a comment

liu-cong commented Mar 17, 2025

kfswain commented Mar 17, 2025

christian-posta left a comment

christian-posta Mar 18, 2025

liu-cong commented Mar 18, 2025

ahg-g commented Mar 18, 2025

k8s-ci-robot commented Mar 18, 2025

liu-cong commented Mar 18, 2025

liu-cong commented Mar 18, 2025

ahg-g commented Mar 18, 2025

Add instructions to run benchmarks #480

Add instructions to run benchmarks #480

Conversation

liu-cong commented Mar 12, 2025

netlify bot commented Mar 12, 2025 • edited Loading

✅ Deploy Preview for gateway-api-inference-extension ready!

liu-cong commented Mar 12, 2025

ahg-g Mar 13, 2025

Choose a reason for hiding this comment

liu-cong Mar 17, 2025

Choose a reason for hiding this comment

xzhaosg Mar 17, 2025

Choose a reason for hiding this comment

christian-posta Mar 18, 2025

Choose a reason for hiding this comment

liu-cong Mar 18, 2025

Choose a reason for hiding this comment

ahg-g commented Mar 13, 2025

ahg-g commented Mar 13, 2025

ahg-g left a comment

Choose a reason for hiding this comment

liu-cong commented Mar 17, 2025

kfswain commented Mar 17, 2025

christian-posta left a comment

Choose a reason for hiding this comment

christian-posta Mar 18, 2025

Choose a reason for hiding this comment

liu-cong commented Mar 18, 2025

ahg-g commented Mar 18, 2025

k8s-ci-robot commented Mar 18, 2025

liu-cong commented Mar 18, 2025

liu-cong commented Mar 18, 2025

ahg-g commented Mar 18, 2025

netlify bot commented Mar 12, 2025 •

edited

Loading