You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We should provide guidance on upgrading/downgrading EPP versions to minimize downtime. Depending on the tradeoff between risk and speed, I imagine there can be different upgrade strategies:
Simply do a rolling update of the EPP deployment. With the active-passive HA mode EPP HA deployment #692, a minimal disruption to traffic may be expected when switching the leader.
Blue-green upgrade. Deploy a new EPP deployment, verify it works, and update the ExtensionRef.
Create a new InferencePool (still pointing to the same model servers) with new EPP, and configure HTTPRoutes to split traffic between new and old inference pools.
Why is this needed:
The text was updated successfully, but these errors were encountered:
What would you like to be added:
We should provide guidance on upgrading/downgrading EPP versions to minimize downtime. Depending on the tradeoff between risk and speed, I imagine there can be different upgrade strategies:
Why is this needed:
The text was updated successfully, but these errors were encountered: