coreDNS unable to resolve upstream #53

latchmihay · 2019-02-26T23:01:20Z

Hello, I have plain installation of k3s on an ubuntu 18.04

I am running a container which is failing to resolve DNS

# nslookup index.docker.io 10.43.0.10
Server:    10.43.0.10
Address 1: 10.43.0.10 kube-dns.kube-system.svc.cluster.local

nslookup: can't resolve 'index.docker.io': Try again

# nslookup quay.io 10.43.0.10
Server:    10.43.0.10
Address 1: 10.43.0.10 kube-dns.kube-system.svc.cluster.local

nslookup: can't resolve 'quay.io': Try again

# k3s kubectl logs -f pod/coredns-7748f7f6df-8htwl -n kube-system
2019-02-26T22:52:50.556Z [ERROR] plugin/errors: 2 index.docker.io. AAAA: unreachable backend: read udp 10.42.0.6:50878->1.1.1.1:53: i/o timeout
2019-02-26T22:52:50.556Z [ERROR] plugin/errors: 2 index.docker.io. A: unreachable backend: read udp 10.42.0.6:38587->1.1.1.1:53: i/o timeout
2019-02-26T22:53:18.425Z [ERROR] plugin/errors: 2 quay.io. AAAA: unreachable backend: read udp 10.42.0.6:48427->1.1.1.1:53: i/o timeout
2019-02-26T22:53:18.425Z [ERROR] plugin/errors: 2 quay.io. A: unreachable backend: read udp 10.42.0.6:53214->1.1.1.1:53: i/o timeout

I am not sure what 1.1.1.1 is and where its coming from

jadsonlourenco · 2019-02-27T10:39:42Z

I am not sure what 1.1.1.1 is and where its coming from

This is the CloudFlare DNS public service, like the Google DNS 8.8.8.8

latchmihay · 2019-02-27T15:40:29Z

Hmm, its probably being blocked on my network. Any idea how its being configured and how I could change it?

ibuildthecloud · 2019-02-27T16:06:32Z

@latchmihay We may have hard coded 1.1.1.1. We will make that configurable. The default behavior of k8s is to use the hosts /etc/resolv.conf as the upstream DNS but because of systemd-resolved being the default these days (and older dnsmasq setups) it is typically 127.0.0.x IP and then breaks. So it's super hard in general to figure out what the upstream DNS should actually be. So we probably hardcoded it to 1.1.1.1.

We will add this as an option to the agent and also document it.

jadsonlourenco · 2019-02-27T16:15:48Z

@ibuildthecloud Please maintain the current way, make configurable, but if you keep save a log of time because of the issue that you said, I've got it many time, and need be a new step on the new servers installations...
Anyway thank you, I was hoping it to migrate to the new Rancher v2.

bechampion · 2019-02-27T20:54:48Z

i fixed it changing the configmap for coredns from 1.1.1.1 to 8.8.8.8 ... for whatever reason 1.1.1.1:53 I could not reach

DMW007 · 2019-03-03T10:29:28Z

This can be done by replacing proxy . 1.1.1.1 with your own dns server in cm coredns. I wrote a detailled guide how to change this manually and automated for tools like ansible here: https://devops.stackexchange.com/a/6521/6923

erikwilson · 2019-03-27T00:21:31Z

We have created a release candidate v0.3.0-rc3 which will hopefully fix these DNS issues. Please try it out and let me know if it helps!

The settings are configurable in that we will either take a --resolv-conf flag to pass down to the kubelet, or a K3S_RESOLV_CONF environment variable will work also. We now try to use system resolv.conf files (from /etc & systemd), and will create a /tmp/k3s-resolv.conf file with nameserver 8.8.8.8 if nameservers in the system files are not global unicast ips.

DMW007 · 2019-03-30T20:20:08Z

I tried it out on Ubuntu 16.04.6 LTS with v0.3.0 (9a1a1ec) since the final 0.3 got released a few hours ago. Using curl -sfL https://get.k3s.io | K3S_RESOLV_CONFIG=192.168.0.19 sh - and removing my sed workaround from cm/coredns it works, but only without providing a custom TLD:

root@rocket-chat:/# ping my-pc
PING my-pc.fritz.box (192.168.0.20) 56(84) bytes of data.
64 bytes from my-PC.fritz.box (192.168.0.20): icmp_seq=1 ttl=61 time=0.787 ms

But when I try ping my-pc.fritz.box it can't resolve. nslookup also timed out:

root@rocket-chat:/# nslookup my-pc.fritz.box
;; connection timed out; no servers could be reached

Using other machines in the same networks that use 192.168.0.19 as dns-server, both domains were resolved successfully. Altough inside Vagrant I'm able to resolve my-pc.fritz.box, it may has something to do that I'm trying this in Vagrant on Ubuntu 18.04. Content of /etc/resolv.conf inside vagrant:

nameserver 10.0.2.2
search fritz.box

Update: It's a Kubernetes issue

Found out that this was caused by Kubernetes ndots config. Per default, we have options ndots:5 set in resolv.conf. This means that dns names must contain at least five dots before they were processed as an absolute name. my-pc doesn't contain any dots, so it's resolved absolute by our upsteam 192.168.0.19 where we have an alias without .fritz.box suffix by default.

But my-pc.fritz.box contains two dots. The default setting is ndots:1 so that any dns name with at least one dot would be resolved as absolute dns. Since Kubernetes has ndots:5 the my-pc.fritz.box is resolved as relative dns. So it would apply all suffixes from search. This can't work since it would apply another .fritz.box suffix, so my-pc.fritz.box would become my-pc.fritz.box.fritz.box.

I assume that this should speed up things for internal cluster dns entrys. But for external dns, it can slow down things. Using apt-get for installing some debug packages like netutils was very slow. Since I switched to the default ndots:5 it got pretty fast like on my working machine. You can also find blog posts about this issue. But in my case, the primary problem was that it breaks my absolute external dns entrys.

To solve this, customize the pods dns configuration by applying it to containerlevel at the pods definition:

  containers:
  # ...
  dnsConfig:
    options:
      - name: ndots
        value: "1"

But regarding to Kubernetes own dns, I'd consider this as a workaround for local purpose since I'm not completely aware of the productive peformance yet. As another solution, we may force absolute domain names by a leading dot.

Currently, I'm using thednsConfig entry and dns works well with my custom server. So this problem wasn't related to k3s directly and the fix in 0.3 works well :)

lindhe · 2020-09-25T18:43:23Z

Is there any way to set the dnsConfig options globally instead of on a per-pod basis?

brettinternet · 2022-10-29T07:34:24Z

For anyone arriving here from a search engine, I was able to resolve my cluster's DNS issues by

(a) using the legacy iptables rather than nftables, (b) ensuring the CNI is correctly installed (I use Calico with hardware that has multiple NICs and this requires additional setup for IP detection), and (c) flushing the iptables leftover from the CNI in between cluster installs.

iptables --version
# iptables v1.8.7 (legacy)

iptables -P INPUT ACCEPT
iptables -P FORWARD ACCEPT
iptables -P OUTPUT ACCEPT
iptables -t nat -F
iptables -t mangle -F
iptables -F
iptables -X
# ... Install k3s

akelge · 2024-01-19T04:57:40Z

Not completely on topic, but the fact that issue 53 is related to DNS issues sounds done on purpose :)

aaliddell mentioned this issue Mar 13, 2019

cluster networking is broken? #24

Closed

erikwilson added the help wanted label Mar 25, 2019

erikwilson added the status/to-test label Mar 29, 2019

deniseschannon added this to the v0.4.0 milestone Apr 2, 2019

deniseschannon assigned erikwilson Apr 3, 2019

deniseschannon closed this as completed Apr 11, 2019

cwiggs mentioned this issue Oct 31, 2019

k3s containers cannot reach stable helm charts. #999

Closed

gizmotronic mentioned this issue Jun 29, 2020

Automated upgrade fails to upgrade v1.18.3+k3s1 to stable #1954

Closed

LiilyZhang mentioned this issue Jun 30, 2020

Update k3s with a known issue related to dns open-horizon/anax#1895

Closed

seanho00 mentioned this issue Feb 25, 2021

DNS failure in alpine dockers ho-ansible/k3s#6

Closed

justinrush mentioned this issue Apr 26, 2021

customize coredns upstream servers on install #3233

Closed

boeboe mentioned this issue Jun 29, 2021

Install k3s with with-node-id flag causes #3533

Closed

binbashblog mentioned this issue Jan 17, 2022

Failed to list *v1beta1.Ingress: the server could not find the requested resource (get ingresses.extensions) #4967

Closed

1 task

lostcoaster mentioned this issue Jan 17, 2022

kubernetes iptables rules not automatically setup after reinstalling #4969

Closed

abhishek-hvt mentioned this issue Feb 13, 2024

k3s Pod unable to access internet. #9476

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

coreDNS unable to resolve upstream #53

coreDNS unable to resolve upstream #53

latchmihay commented Feb 26, 2019 •

edited

jadsonlourenco commented Feb 27, 2019

latchmihay commented Feb 27, 2019

ibuildthecloud commented Feb 27, 2019

jadsonlourenco commented Feb 27, 2019

bechampion commented Feb 27, 2019

DMW007 commented Mar 3, 2019 •

edited

erikwilson commented Mar 27, 2019

DMW007 commented Mar 30, 2019 •

edited

lindhe commented Sep 25, 2020

brettinternet commented Oct 29, 2022

akelge commented Jan 19, 2024

coreDNS unable to resolve upstream #53

coreDNS unable to resolve upstream #53

Comments

latchmihay commented Feb 26, 2019 • edited

jadsonlourenco commented Feb 27, 2019

latchmihay commented Feb 27, 2019

ibuildthecloud commented Feb 27, 2019

jadsonlourenco commented Feb 27, 2019

bechampion commented Feb 27, 2019

DMW007 commented Mar 3, 2019 • edited

erikwilson commented Mar 27, 2019

DMW007 commented Mar 30, 2019 • edited

Update: It's a Kubernetes issue

lindhe commented Sep 25, 2020

brettinternet commented Oct 29, 2022

akelge commented Jan 19, 2024

latchmihay commented Feb 26, 2019 •

edited

DMW007 commented Mar 3, 2019 •

edited

DMW007 commented Mar 30, 2019 •

edited