Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug 1971808: fix local-with-fallback #320

Merged

Conversation

danwinship
Copy link
Contributor

The new local-with-fallback hack was interfering with cluster-internal traffic to local-with-fallback services (which already would have worked fine even when there were no local pods), breaking cluster-internal source IP preservation (and maybe other things).

This reverts the previous hack and replaces it with a hack that preserves almost all of the current handling of externalTrafficPolicy: Local services, and only changes the bit at the end. In particular, this preserves the short-circuit of pod-to-loadbalancer-IP for Local services, which the previous hack did not.

/cc @Miciah @smarterclayton @sgreene570

rebases to openshift/kubernetes#813

@openshift-ci openshift-ci bot requested a review from Miciah June 16, 2021 19:24
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jun 16, 2021

@danwinship: An error was encountered searching for bug 19718018 on the Bugzilla server at https://bugzilla.redhat.com. No known errors were detected, please see the full error message for details.

Full error message. code 101: Bug #19718018 does not exist.

Please contact an administrator to resolve this issue, then request a bug refresh with /bugzilla refresh.

In response to this:

WIP Bug 19718018: fix local-with-fallback

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci openshift-ci bot added do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. approved Indicates a PR has been approved by an approver from all required OWNERS files. labels Jun 16, 2021
@danwinship danwinship changed the title WIP Bug 19718018: fix local-with-fallback WIP Bug 1971808: fix local-with-fallback Jun 16, 2021
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jun 16, 2021

@danwinship: This pull request references Bugzilla bug 1971808, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker.

3 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target release (4.9.0) matches configured target release for branch (4.9.0)
  • bug is in the state NEW, which is one of the valid states (NEW, ASSIGNED, ON_DEV, POST, POST)

Requesting review from QA contact:
/cc @zhaozhanqi

In response to this:

WIP Bug 1971808: fix local-with-fallback

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci openshift-ci bot added bugzilla/severity-high Referenced Bugzilla bug's severity is high for the branch this PR is targeting. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. labels Jun 16, 2021
@openshift-ci openshift-ci bot requested a review from zhaozhanqi June 16, 2021 19:24
@danwinship
Copy link
Contributor Author

/retest

@danwinship
Copy link
Contributor Author

ok, this seems to work AFAICT:

  • The router responds via its NodePort on all node IPs, even the nodes it doesn't have an endpoint on
  • If I create an externalTrafficPolicy: Local NodePort service without the annotation, it only responds on the host that holds the endpoint. But if I then add the local-with-fallback annotation, it then responds on all nodes.
  • Following the reproducer from bz 1971808, the service gives the correct source IP from any node

@danwinship danwinship changed the title WIP Bug 1971808: fix local-with-fallback Bug 1971808: fix local-with-fallback Jun 18, 2021
@openshift-ci openshift-ci bot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jun 18, 2021
@danwinship
Copy link
Contributor Author

/retest

@knobunc
Copy link
Contributor

knobunc commented Jun 21, 2021

/lgtm

@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Jun 21, 2021
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jun 21, 2021

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: danwinship, knobunc

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

5 similar comments
@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@danwinship
Copy link
Contributor Author

/test e2e-aure
/override ci/prow/e2e-gcp

@danwinship
Copy link
Contributor Author

/test e2e-azure

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jun 21, 2021

@danwinship: The specified target(s) for /test were not found.
The following commands are available to trigger jobs:

  • /test e2e-aws
  • /test e2e-aws-multitenant
  • /test e2e-aws-upgrade
  • /test e2e-gcp
  • /test images
  • /test unit
  • /test verify
  • /test verify-deps

Use /test all to run the following jobs:

  • pull-ci-openshift-sdn-master-e2e-aws
  • pull-ci-openshift-sdn-master-e2e-aws-upgrade
  • pull-ci-openshift-sdn-master-e2e-gcp
  • pull-ci-openshift-sdn-master-images
  • pull-ci-openshift-sdn-master-unit
  • pull-ci-openshift-sdn-master-verify
  • pull-ci-openshift-sdn-master-verify-deps

In response to this:

/test e2e-aure
/override ci/prow/e2e-gcp

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jun 21, 2021

@danwinship: The specified target(s) for /test were not found.
The following commands are available to trigger jobs:

  • /test e2e-aws
  • /test e2e-aws-multitenant
  • /test e2e-aws-upgrade
  • /test e2e-gcp
  • /test images
  • /test unit
  • /test verify
  • /test verify-deps

Use /test all to run the following jobs:

  • pull-ci-openshift-sdn-master-e2e-aws
  • pull-ci-openshift-sdn-master-e2e-aws-upgrade
  • pull-ci-openshift-sdn-master-e2e-gcp
  • pull-ci-openshift-sdn-master-images
  • pull-ci-openshift-sdn-master-unit
  • pull-ci-openshift-sdn-master-verify
  • pull-ci-openshift-sdn-master-verify-deps

In response to this:

/test e2e-azure

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jun 21, 2021

@danwinship: Overrode contexts on behalf of danwinship: ci/prow/e2e-gcp

In response to this:

/test e2e-aure
/override ci/prow/e2e-gcp

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@danwinship
Copy link
Contributor Author

Whatever. GCP failures are infrastructural, and I was testing on GCP when I wrote #320 (comment) so I know it would work if the cluster actually came up.

@openshift-merge-robot openshift-merge-robot merged commit 3ba568f into openshift:master Jun 21, 2021
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jun 21, 2021

@danwinship: All pull requests linked via external trackers have merged:

Bugzilla bug 1971808 has been moved to the MODIFIED state.

In response to this:

Bug 1971808: fix local-with-fallback

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. bugzilla/severity-high Referenced Bugzilla bug's severity is high for the branch this PR is targeting. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. lgtm Indicates that a PR is ready to be merged.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants