Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adjust how kube-controllers timeout works #6610

Merged

Conversation

caseydavenport
Copy link
Member

Description

  • Remove 2 second timeout for healthz call
  • Replace with exponential backoff timeout using a context
  • Retry sooner if we encounter an error

Related issues/PRs

Todos

  • Tests
  • Documentation
  • Release note

Release Note

Use exponential backoff for kube-controllers health check timeout, retry sooner if failed.

Reminder for the reviewer

Make sure that this PR has the correct labels and milestone set.

Every PR needs one docs-* label.

  • docs-pr-required: This change requires a change to the documentation that has not been completed yet.
  • docs-completed: This change has all necessary documentation completed.
  • docs-not-required: This change has no user-facing impact and requires no docs.

Every PR needs one release-note-* label.

  • release-note-required: This PR has user-facing changes. Most PRs should have this label.
  • release-note-not-required: This PR has no user-facing changes.

Other optional labels:

  • cherry-pick-candidate: This PR should be cherry-picked to an earlier release. For bug fixes only.
  • needs-operator-pr: This PR is related to install and requires a corresponding change to the operator.

@caseydavenport caseydavenport requested a review from a team as a code owner August 24, 2022 14:56
@marvin-tigera marvin-tigera added this to the Calico v3.25.0 milestone Aug 24, 2022
@marvin-tigera marvin-tigera added docs-pr-required Change is not yet documented release-note-required Change has user-facing impact (no matter how small) labels Aug 24, 2022
@@ -88,7 +88,7 @@ The default values.yaml should be suitable for most basic deployments.

```
# Image pull secrets to provision for pulling images from private registries.
# If provided, references to the secrets must also be provided in the installation section.
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Whoops, this is unrelated but still good to merge 😅

@@ -261,9 +266,10 @@ func runHealthChecks(ctx context.Context, s *status.Status, k8sClientset *kubern
// carry on
}

// skip healthchecks if configured
// Datastore HealthCheck
healthCtx, cancel := context.WithTimeout(ctx, 10*time.Second)
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe we should have two different timeouts - EnsureInitialized can potentially make a few HTTP calls under the hood.

@caseydavenport caseydavenport added docs-not-required Docs not required for this change merge-when-ready labels Aug 26, 2022
@marvin-tigera marvin-tigera removed the docs-pr-required Change is not yet documented label Aug 26, 2022
@marvin-tigera marvin-tigera merged commit 913fedc into projectcalico:master Aug 26, 2022
@caseydavenport caseydavenport deleted the casey-fix-kc-health branch August 26, 2022 17:22
caseydavenport added a commit that referenced this pull request Aug 26, 2022
…n-release-v3.24

[release-v3.24] Auto pick #6610: Adjust how kube-controllers timeout works
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
docs-not-required Docs not required for this change merge-when-ready release-note-required Change has user-facing impact (no matter how small)
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants