Replies: 2 comments 1 reply
-
Please assist :) |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hey, I'm trying to reverse engineering to the kpa to understand what are the api calls the kpa performs in order to scale up or down an inference service so I can do it manually. This inference service has min 1 replicas and 100 max. Let's say I want to scale it manually to 2 inference services pods.. I can't seem to figure out how to do it. Scaling the deployment the revision created did not work.. Any suggestions??
originally posted by @daganida88 #1990
Beta Was this translation helpful? Give feedback.
All reactions