dnsmask failed to create inotify #2709

pierreozoux · 2017-06-12T12:02:27Z

kops version: 1.6
kubernetes version: 1.6.1
Networking: canal
Cloud: AWS
Node age: 26d

Here are the logs we saw:

I0612 11:07:05.749783       1 main.go:76] opts: {{/usr/sbin/dnsmasq [-k --cache-size=1000 --log-facility=- --server=/cluster.local/127.0.0.1#10053 --server=/in-addr.arpa/127.0.0.1#10053 --server=/in6.arpa/127.0.0.1#10053] true} /etc/k8s/dns/dnsmasq-nanny 10000000000}
I0612 11:07:05.750284       1 nanny.go:86] Starting dnsmasq [-k --cache-size=1000 --log-facility=- --server=/cluster.local/127.0.0.1#10053 --server=/in-addr.arpa/127.0.0.1#10053 --server=/in6.arpa/127.0.0.1#10053]
I0612 11:07:05.822074       1 nanny.go:108] 
I0612 11:07:05.822091       1 nanny.go:108] dnsmasq: failed to create inotify: No file descriptors available
I0612 11:07:05.822117       1 nanny.go:111] 
W0612 11:07:05.822124       1 nanny.go:112] Got EOF from stderr
I0612 11:07:05.822148       1 nanny.go:111] 
W0612 11:07:05.822161       1 nanny.go:112] Got EOF from stdout
F0612 11:07:05.822175       1 nanny.go:182] dnsmasq exited: exit status 5

We upgraded the cluster, and the error is gone.

This looks like related to: kubernetes/kubernetes#32526

Kops indeed has this setting on the node:

cat /proc/sys/fs/inotify/max_user_instances
128

Would it be beneficial to update this number? We can PR if necessary?

As a side note, since the beginning of the cluster, I can't tail pods logs with this error message:

failed to create fsnotify watcher: too many open files%

I'm not sure it is related, but I though it is worth to mention.

As a second side note, inodes there are 4.33M free inodes.

The text was updated successfully, but these errors were encountered:

eigokor · 2017-06-12T12:26:03Z

Just for information, in order to finish upgrade we had to kill failing pod each time after next node were recreated. Since cops were reporting
0612 14:00:48.645469 41005 rollingupdate_cluster.go:430] Cluster did not validate, and waiting longer: your kube-system pods are NOT healthy

Once failing pod deleted, rolling update can be started again.

chrislovecnm · 2017-06-12T20:34:05Z

@pierreozoux

Would it be beneficial to update this number? We can PR if necessary?

Does updating this number fix the issue? We support a bunch of different operating systems and I am uncertain which OS's this impacts.

If we can figure out a solution, please PR ;)

pierreozoux · 2017-07-18T14:12:47Z

Closing in favor of #2912

pierreozoux mentioned this issue Jun 12, 2017

/proc/sys/fs/inotify/max_user_watches is too low kubernetes/kubernetes#46230

Closed

chrislovecnm mentioned this issue Jul 12, 2017

Increase fs.inotify.max_user_instances limit. Fixes #2912 #2913

Merged

pierreozoux closed this as completed Jul 18, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dnsmask failed to create inotify #2709

dnsmask failed to create inotify #2709

pierreozoux commented Jun 12, 2017

eigokor commented Jun 12, 2017 •

edited

chrislovecnm commented Jun 12, 2017

pierreozoux commented Jul 18, 2017

dnsmask failed to create inotify #2709

dnsmask failed to create inotify #2709

Comments

pierreozoux commented Jun 12, 2017

eigokor commented Jun 12, 2017 • edited

chrislovecnm commented Jun 12, 2017

pierreozoux commented Jul 18, 2017

eigokor commented Jun 12, 2017 •

edited