Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bug/pkg/health: Fix Nil Address Issue in Node Update Mechanism #17667

Merged

Conversation

nathanjsweet
Copy link
Member

It is possible for a node with a "" primary address to
be "added" to the health pkg cache. When this happens subsequent
updates will not flush out the bad value and thus it will
persist as a valid status for a node until the entire cilium
daemon is reset.

This fixes the bug by not caching nil values at all.

Signed-off-by: Nate Sweet nathanjsweet@pm.me

@nathanjsweet nathanjsweet added release-note/bug This PR fixes an issue in a previous release of Cilium. area/health Relates to the cilium-health component labels Oct 21, 2021
@nathanjsweet nathanjsweet requested review from joestringer and a team October 21, 2021 21:57
@maintainer-s-little-helper maintainer-s-little-helper bot added this to Needs backport from master in 1.10.6 Oct 21, 2021
@maintainer-s-little-helper maintainer-s-little-helper bot added this to Needs backport from master in 1.9.11 Oct 22, 2021
@nathanjsweet nathanjsweet force-pushed the pr/nathanjsweet/skip-nil-ips-on-health-pkg-node-update branch 3 times, most recently from 5d44fe9 to 1bface3 Compare October 25, 2021 15:13
Copy link
Member

@christarazi christarazi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks

@nathanjsweet
Copy link
Member Author

/test

@nathanjsweet
Copy link
Member Author

I believe cilium/cilium-cli#572 fixed multi-cluster failing intermittently for CI. All tests are currently passing (including the kernel tests). I'm going to rebase and if the multi-cluster tests pass then I'm marking this PR as ready-to-merge.

It is possible for a node with a "<nil>" primary address to
be "added" to the health pkg cache. When this happens subsequent
updates **will not** flush out the bad value and thus it will
persist as a valid status for a node until the entire cilium
daemon is reset.

This fixes the bug by not caching nil values at all.

Signed-off-by: Nate Sweet <nathanjsweet@pm.me>
@nathanjsweet nathanjsweet force-pushed the pr/nathanjsweet/skip-nil-ips-on-health-pkg-node-update branch from 1bface3 to 1c9ee7c Compare October 28, 2021 18:10
@nathanjsweet nathanjsweet added the ready-to-merge This PR has passed all tests and received consensus from code owners to merge. label Oct 28, 2021
@nebril nebril merged commit c9da51c into master Oct 29, 2021
@nebril nebril deleted the pr/nathanjsweet/skip-nil-ips-on-health-pkg-node-update branch October 29, 2021 09:43
@joestringer joestringer added this to Needs backport from master in 1.9.12 Nov 5, 2021
@joestringer joestringer removed this from Needs backport from master in 1.9.11 Nov 5, 2021
@maintainer-s-little-helper maintainer-s-little-helper bot moved this from Needs backport from master to Backport done to v1.10 in 1.10.6 Nov 23, 2021
@joestringer joestringer moved this from Needs backport from master to Backport done to v1.9 in 1.9.12 Dec 6, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/health Relates to the cilium-health component ready-to-merge This PR has passed all tests and received consensus from code owners to merge. release-note/bug This PR fixes an issue in a previous release of Cilium.
Projects
No open projects
1.10.6
Backport done to v1.10
1.9.12
Backport done to v1.9
Development

Successfully merging this pull request may close these issues.

None yet

8 participants