Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hey, I'm trying to reverse engineering to the kpa to understand what are the api calls the kpa performs in order to scale up or down an inference service so I can do it manually. This inference service has min 1 replicas and 100 max. Let's say I want to scale it manually to 2 inference services pods.. I can't seem to figure out how to do it. Scaling the deployment the revision created did not work.. Any suggestions?? #1990

Closed
daganida88 opened this issue Jan 13, 2022 · 2 comments

Comments

@daganida88
Copy link

No description provided.

@yuzisun
Copy link
Member

yuzisun commented Jan 15, 2022

@daganida88 Can we move this to Discussions?

@yuzisun
Copy link
Member

yuzisun commented Jan 15, 2022

see #1993

@yuzisun yuzisun closed this as completed Jan 15, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants