Skip to content

EPP upgrade/downgrade guide #693

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
Tracked by #681
liu-cong opened this issue Apr 14, 2025 · 0 comments
Open
Tracked by #681

EPP upgrade/downgrade guide #693

liu-cong opened this issue Apr 14, 2025 · 0 comments
Labels
triage/accepted Indicates an issue or PR is ready to be actively worked on.

Comments

@liu-cong
Copy link
Contributor

What would you like to be added:

We should provide guidance on upgrading/downgrading EPP versions to minimize downtime. Depending on the tradeoff between risk and speed, I imagine there can be different upgrade strategies:

  1. Simply do a rolling update of the EPP deployment. With the active-passive HA mode EPP HA deployment #692, a minimal disruption to traffic may be expected when switching the leader.
  2. Blue-green upgrade. Deploy a new EPP deployment, verify it works, and update the ExtensionRef.
  3. Create a new InferencePool (still pointing to the same model servers) with new EPP, and configure HTTPRoutes to split traffic between new and old inference pools.

Why is this needed:

@kfswain kfswain mentioned this issue Apr 23, 2025
17 tasks
@kfswain kfswain added the triage/accepted Indicates an issue or PR is ready to be actively worked on. label Apr 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
triage/accepted Indicates an issue or PR is ready to be actively worked on.
Projects
None yet
Development

No branches or pull requests

2 participants