Skip to content

Conversation

@Xunzhuo
Copy link
Member

@Xunzhuo Xunzhuo commented Nov 14, 2025

Description

This PR reduces the initial delay seconds for both liveness and readiness probes in the Helm chart configuration to improve startup time and enable faster detection of service availability.

Changes

  • Reduced liveness probe initialDelaySeconds from 120s to 30s
  • Reduced readiness probe initialDelaySeconds from 180s to 30s

Motivation

Shorter initial delays allow Kubernetes to detect service availability faster, reducing the time before the service can start receiving traffic. The previous values (120s and 180s) were overly conservative for most deployment scenarios.


  • Make sure the code changes pass the pre-commit checks.
  • Sign-off your commit by using -s when doing git commit
  • Try to classify PRs for easy understanding of the type of changes, such as [Bugfix], [Feat], and [CI].

Reduce initialDelaySeconds from 120s to 30s for liveness probe
and from 180s to 30s for readiness probe to improve startup time
and faster detection of service availability.

Signed-off-by: bitliu <bitliu@tencent.com>
@Xunzhuo Xunzhuo requested a review from rootfs as a code owner November 14, 2025 08:46
@netlify
Copy link

netlify bot commented Nov 14, 2025

Deploy Preview for vllm-semantic-router ready!

Name Link
🔨 Latest commit 4b41bdd
🔍 Latest deploy log https://app.netlify.com/projects/vllm-semantic-router/deploys/6916ec841d14030008403e34
😎 Deploy Preview https://deploy-preview-652--vllm-semantic-router.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@github-actions
Copy link

👥 vLLM Semantic Team Notification

The following members have been identified for the changed files in this PR and have been automatically assigned:

📁 deploy

Owners: @rootfs, @Xunzhuo
Files changed:

  • deploy/helm/semantic-router/values.yaml

vLLM

🎉 Thanks for your contributions!

This comment was automatically generated based on the OWNER files in the repository.

@JaredforReal JaredforReal merged commit bb68733 into main Nov 14, 2025
16 checks passed
@Xunzhuo Xunzhuo deleted the chore/reduce-probe-initial-delays branch November 16, 2025 15:02
szedan-rh pushed a commit to szedan-rh/semantic-router that referenced this pull request Nov 19, 2025
…roject#652)

Reduce initialDelaySeconds from 120s to 30s for liveness probe
and from 180s to 30s for readiness probe to improve startup time
and faster detection of service availability.

Signed-off-by: bitliu <bitliu@tencent.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants