-
Notifications
You must be signed in to change notification settings - Fork 54
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Autoscale up on memory usage #160
Comments
If the k8s platform has metrics enable, this can be achieved via HorizontalPodAutoscaler.
Anyone have preferences? |
imo option 2 sounds like a winner |
This MUST be automated. So option 2 all the way |
A good starting point for working on a memory usage metric is the method calculateRequiredMinimumNumberOfNodes. From the code above we can derive some assertions:
|
@tristantarrant I tried above to resume our discussion about memory autoscaling. Please add anything you thing could be useful to the discussion. Also @danberindei, @wburns if you want to add your though, thanks! |
An invalidation and LOCAL cache don't really make sense in a server with a cluster, period. Since you can't guarantee which node you will talk to. So I am not sure if we have to worry about those at all. Replicated doesn't make sense for memory scaling as you mentioned. Requires vertical scaling, which this can't do afaik.
I guess we need to detail what we want to do. Cause you could have the issue that a single cache could run into eviction early, which might be fine? What is the goal here again, I forgot? Should it be based solely on JVM heap size for example? |
I can see two cases
I would go for 1 as first implementation. @tristantarrant? |
@anistor pr infinispan/infinispan#7857 |
I did some more tests. It seems that the standard memory metric is ok to measure offheap usage whether mem usage is increasing or decreasing. Heap memory is handled by the jvm and it's usually not released to the OS in a predictable way or not release at all due to the greedy jvm attitude. So I'll do the following: HEAP USE CASE OFFHEAP USE CASE
|
Kubernetes HPA does only linear computation: |
No description provided.
The text was updated successfully, but these errors were encountered: