Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug 1882209: baremetal & friends: Set coredns forward policy to sequential #2122

Merged
merged 1 commit into from Jan 21, 2021

Conversation

cybertron
Copy link
Member

This was already done for openstack in #1527, but not for baremetal
because we don't have any separation between external and internal
hostnames. However, in environments where the DNS servers are
misconfigured such that the second one listed in resolv.conf does
not properly resolve external names, this can cause intermittent
resolution failures from our coredns because it randomly chooses to
use the broken server.

Switching the forward policy to sequential will avoid potentially
confusing issues where a set of DNS servers works fine standalone
but breaks when configured as the forwarding upstreams in coredns.

- What I did

- How to verify it

- Description for the changelog

@openshift-ci-robot
Copy link
Contributor

@cybertron: This pull request references Bugzilla bug 1882209, which is invalid:

  • expected the bug to target the "4.6.0" release, but it targets "---" instead

Comment /bugzilla refresh to re-evaluate validity if changes to the Bugzilla bug are made, or edit the title of this pull request to link to a different bug.

In response to this:

Bug 1882209: baremetal: Set coredns forward policy to sequential

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci-robot openshift-ci-robot added bugzilla/severity-high Referenced Bugzilla bug's severity is high for the branch this PR is targeting. bugzilla/invalid-bug Indicates that a referenced Bugzilla bug is invalid for the branch this PR is targeting. labels Sep 28, 2020
@cybertron
Copy link
Member Author

/bugzilla refresh

@openshift-ci-robot openshift-ci-robot added bugzilla/severity-medium Referenced Bugzilla bug's severity is medium for the branch this PR is targeting. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. and removed bugzilla/severity-high Referenced Bugzilla bug's severity is high for the branch this PR is targeting. labels Oct 23, 2020
@openshift-ci-robot
Copy link
Contributor

@cybertron: This pull request references Bugzilla bug 1882209, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker.

3 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target release (4.7.0) matches configured target release for branch (4.7.0)
  • bug is in the state NEW, which is one of the valid states (NEW, ASSIGNED, ON_DEV, POST, POST)

In response to this:

/bugzilla refresh

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci-robot openshift-ci-robot removed the bugzilla/invalid-bug Indicates that a referenced Bugzilla bug is invalid for the branch this PR is targeting. label Oct 23, 2020
@cybertron
Copy link
Member Author

/test e2e-metal-ipi

@kikisdeliveryservice
Copy link
Contributor

/assign @yboaron

@runcom
Copy link
Member

runcom commented Nov 2, 2020

ping @cybertron

@cybertron
Copy link
Member Author

Sorry, I missed that this was in merge conflict. Should be ready to go now.

@kikisdeliveryservice
Copy link
Contributor

@yboaron @celebdor PTAL

@dougsland
Copy link
Contributor

This patch make sense for ovirt as well. Going to give a try.

This was already done for openstack in openshift#1527, but not for baremetal
because we don't have any separation between external and internal
hostnames. However, this behavior is unintuitive and we've found a
number of instances where it has caused problems on the other
platforms too.

Switching the forward policy to sequential will avoid potentially
confusing issues where a set of DNS servers works fine standalone
but breaks when configured as the forwarding upstreams in coredns.
@cybertron
Copy link
Member Author

Okay, I've rebased this to pick up the dedupe changes and to apply it to the other on-prem platforms. It's causing confusing issues on all platforms.

@mandre @rgolangh @jcpowermac @patrickdillon @bcrochet I think with this we can dedupe the corefile template too. I believe the only differences between them were this sequential change and whether they use the template or a db file. The latter is an implementation detail that yields effectively the same results so it shouldn't block us.

Is anyone aware of any other differences I may have missed that would cause issues if we consolidated the coredns config?

@cybertron
Copy link
Member Author

/test e2e-ovirt
/test e2e-vsphere

@dougsland
Copy link
Contributor

/retest

@mandre
Copy link
Member

mandre commented Nov 10, 2020

Is anyone aware of any other differences I may have missed that would cause issues if we consolidated the coredns config?

OpenStack would be fine switching over to the on-prem template for corefile, at least I don't foresee any issue.

@cybertron
Copy link
Member Author

Looks like the ovirt job has been red for nearly two weeks. It's unlikely that failure is related to this patch.

@dougsland
Copy link
Contributor

tested in my lab, it seems resolved the issue reported on ovirt
/lgtm

@openshift-ci-robot openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Nov 10, 2020
@dougsland
Copy link
Contributor

/retest

1 similar comment
@cybertron
Copy link
Member Author

/retest

@yboaron
Copy link
Contributor

yboaron commented Nov 18, 2020

/lgtm

@cybertron
Copy link
Member Author

/retest

@kikisdeliveryservice
Copy link
Contributor

Are any other sign offs needed based on the affected platforms?

@cybertron
Copy link
Member Author

Maybe @jcpowermac or @patrickdillon . This is a pretty safe change though. It's just making coredns behave the same way the system resolver would.

@rgolangh
Copy link
Contributor

lgtm for kubevirt platform

@dougsland
Copy link
Contributor

/retest

@dougsland
Copy link
Contributor

Would be nice to merge this one soon.

@kikisdeliveryservice
Copy link
Contributor

/assign @jcpowermac @patrickdillon

PTAL

/retest

@kikisdeliveryservice kikisdeliveryservice changed the title Bug 1882209: baremetal: Set coredns forward policy to sequential Bug 1882209: baremetal & friends: Set coredns forward policy to sequential Dec 3, 2020
@jcpowermac
Copy link
Contributor

/lgtm

@cybertron
Copy link
Member Author

@kikisdeliveryservice I think we have what we need to merge this now.

@dougsland
Copy link
Contributor

just a friendly ping for the merge :) @mandre @kikisdeliveryservice

@sandrobonazzola
Copy link

@ashcrow, @cgwalters, @darkmuggle, @kikisdeliveryservice , @runcom , @sinnykumari , @yuqi-zhang anything pending for getting this in?

Copy link
Contributor

@sinnykumari sinnykumari left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM
All tests are green and we have agreement between different platform teams

@openshift-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: cybertron, dougsland, jcpowermac, sinnykumari, yboaron

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci-robot openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jan 21, 2021
@dougsland
Copy link
Contributor

/test e2e-aws-serial

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-merge-robot openshift-merge-robot merged commit 2c59140 into openshift:master Jan 21, 2021
@openshift-ci-robot
Copy link
Contributor

@cybertron: All pull requests linked via external trackers have merged:

Bugzilla bug 1882209 has been moved to the MODIFIED state.

In response to this:

Bug 1882209: baremetal & friends: Set coredns forward policy to sequential

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. bugzilla/severity-medium Referenced Bugzilla bug's severity is medium for the branch this PR is targeting. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. lgtm Indicates that a PR is ready to be merged.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet