Skip to content
This repository has been archived by the owner on Jun 20, 2024. It is now read-only.

[dns] failure to resolve some names #632

Closed
awh opened this issue May 5, 2015 · 4 comments
Closed

[dns] failure to resolve some names #632

awh opened this issue May 5, 2015 · 4 comments

Comments

@awh
Copy link
Contributor

awh commented May 5, 2015

Observed on a Weave 0.10.0 test system owned by @yaronr:

  • Three CoreOS hosts running weave and weaveDNS
  • A single subnet (10.0.0.0/8) used for both weaveDNS and application containers
  • Three named application containers (zoo1, zoo2, zoo3) running Zookeeper, using weaveDNS for discovery

The problem manifested as Zookeeper nodes being unable to peer with each other correctly due to name resolution errors. The following diagnostic steps were performed:

  • weave status on each host confirmed that weave had peered correctly, and that the correct names were registered in each weaveDNS peer
  • docker exec was used to confirm that the resolv.conf was configured correctly in each Zookeeper container, that weaveDNS was reachable, and that the other Zookeeper containers were reachable
  • ip netns exec was used to confirm IP reachability between the weaveDNS containers
  • docker exec getent was used to perform lookups inside each Zookeeper container. The following results were observed:
    • zoo1 could resolve zoo3 but not zoo2
    • zoo3 could resolve zoo1 and zoo2
    • zoo3 could resolve zoo2 but not zoo1

Furthermore, in the course of our investigation, the problem resolved itself and did not recur. The only change we had deliberately made in the meantime was to shut down some other containers that were generating DNS traffic that was hampering our ability to analyse the weaveDNS logs in real time. ps confirmed that none of the weaveDNS containers had been restarted.

@rade rade added the bug label May 5, 2015
@inercia
Copy link
Contributor

inercia commented May 5, 2015

@awh Do you have any logs we could use from the WeaveDNS containers?

@awh
Copy link
Contributor Author

awh commented May 5, 2015

Unfortunately not, we were in the process of shutting down other containers to obtain some clean traces when the problem resolved itself.

@rade rade modified the milestone: 0.11.0 May 12, 2015
@rade rade changed the title WeaveDNS 0.10.0 not resolving all names [dns] failure to resolve some names May 12, 2015
@rade rade modified the milestones: 0.11.0, next May 26, 2015
@awh
Copy link
Contributor Author

awh commented Jun 2, 2015

Closing due to inability to reproduce; it's likely that this defect is obsoleted by the upcoming mDNS->gossip transition anyway.

@awh awh closed this as completed Jun 2, 2015
@rade rade modified the milestones: n/a, 1.0 Jun 16, 2015
@yaronr
Copy link

yaronr commented Aug 23, 2015

Update:
Probably related to https://issues.apache.org/jira/browse/ZOOKEEPER-1506

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

4 participants