-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
endpointmanager: fix bpf policy pressure getting stuck. #28185
endpointmanager: fix bpf policy pressure getting stuck. #28185
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the fix! Just one comment.
82aa9c9
to
05aa798
Compare
/test |
05aa798
to
3f73d45
Compare
/test |
3f73d45
to
efd638c
Compare
/test |
This pull request has been automatically marked as stale because it |
Let's get this PR over the finish line @tommyp1ckles, it's basically there. Just some Go linter errors to fix up. |
@christarazi woops yeah forgot about this one, let me get it ready |
efd638c
to
9e3890a
Compare
/test |
Currently the policy map pressure metric only updates the map pressure metric when a new pressure value that is higher than the current one is set. This means that the metric can only ever go up, so when maps are shrunk (ex. such as after doing an cilium fqdn cache clean) the metric never goes down. This changes the behavior of the metric to maintain a map of map pressure values. When the trigger is invoked, it iterates all values and finds the max - updating the map_pressure gauge for policymaps to the max value. Endpoints that are shut down have their values removed. Signed-off-by: Tom Hadlaw <tom.hadlaw@isovalent.com>
9e3890a
to
5bf6820
Compare
/test |
Currently the policy map pressure metric only updates the map pressure metric when a new pressure value that is higher than the current one is set. This means that the metric can only ever go up, so when maps are shrunk (ex. such as after doing an cilium fqdn cache clean) the metric never goes down.
This changes the behaviour of the metric to maintain a map of map pressure values. When the trigger is invoked, it iterates all values and finds the max - updating the map_pressure gauge for policymaps to the max value. Endpoints that are shut down have their values removed.
CC: @christarazi
Note: This is part of a two part fix for endpoint policy map pressure, for backport purposes the other change was split into a seperate PR: #28184