Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Scheduling on basis of Scalers using KEDA #143

Open
xi2817-aajgaonkar opened this issue Jun 22, 2022 · 3 comments
Open

Scheduling on basis of Scalers using KEDA #143

xi2817-aajgaonkar opened this issue Jun 22, 2022 · 3 comments

Comments

@xi2817-aajgaonkar
Copy link

xi2817-aajgaonkar commented Jun 22, 2022

instead of just time if we can schedule our pods on basis of metrics like HTTP request on ingress, it would be better

We can use keda for it
https://medium.com/backstagewitharchitects/how-autoscaling-works-in-kubernetes-why-you-need-to-start-using-keda-b601b483d355

@xi2817-aajgaonkar
Copy link
Author

we can discuss this feature

@davidebianchi
Copy link
Member

Hi @xi2817-aajgaonkar, it would be interesting but not so easy to scale to 0 based on incoming HTTP request (and to recover the number of replicas at the first incoming call), do you have some ideas?

This operator is more intended as development support, so for the wake up we could also lose the first API calls maybe.

@jwcesign
Copy link

if want to scale to 0, check this: https://github.com/knative/serving/blob/main/docs/scaling/images/scale-from-0.png
Maybe can use a middle component to cache it, and when it's not zero, route to the service.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants