Best method to permanently modify kube-dns configuration? #1089

pedrobizzotto · 2017-12-22T16:14:48Z

Hello all,

kube-aws 0.9.8 user here

To solve some issues we encountered with name resolution I applied two modifications to kube-dns:

Modified the kube-dns configmap as described in http://blog.kubernetes.io/2017/04/configuring-private-dns-zones-upstream-nameservers-kubernetes.html , to use different upstream servers for some domains.
Modified the kube-dns deployment, adding some paramenters to the dnsmasq container, most important is disabling negative caching as described in kube-dns never resolves if a domain returns NOERROR with 0 answer records once kubernetes/dns#121, --no-negcache.

The problem I'm having is that when I replace a controller instance, or make any changes that results in a new controller instances coming up, both the configmap and the deployment are reset to the default values, i've isolated this to the script /opt/bin/install-kube-system that applies the deployment and configmap files present in /srv/kubernetes/manifest every time a controller instance is created.

Is there a way to make the changes permanent without modifying the cloud-init file for the controller?

Thanks,
Pedro S. Bizzotto

The text was updated successfully, but these errors were encountered:

mumoshu · 2017-12-26T01:44:14Z

@pedrobizzotto Hi, thanks for trying kube-aws!

Is there a way to make the changes permanent without modifying the cloud-init file for the controller?

Unfortunately, no - would you like to make it a feature request?
Sharing me a specific reason why you don't want to modify cloud-config-controller for a persistent kube-dns configmap content would also be helpful!

pedrobizzotto · 2017-12-28T11:36:29Z

@mumoshu Hello, thanks for the answer.
The environment i'm working on has an AD server as DNS configured in the DHCP Options for the VPC.
We have some issues when the DNS sends empty responses on some queries, especially non-authoritative, and the empty responses are being cached in the dnsmasq container of the kube-dns pod.
The workaround I had was adding a custom configmap pointing the most used external domains to the VPC DNS endpoint and modifying the options of the dnsmasq daemon to not cache negative responses.

The reason I want to make the changes without altering cloud-config-controller is that any modifications will need to replace the controller instances to be effectively applied. In a dev/QA setup this isn't an issue, but in a production setup this is more serious.

I just had an issue where the deployment and configmap were reset but apparently we had not replaced any of the controller instances, will try to gather more info on this.

Thanks again!

pedrobizzotto · 2017-12-28T12:37:04Z

@mumoshu , hello again,
Seems like the systemd unit install-kube-system is running on every reboot of the controller.
here is a snippet of the output of journalctl -u install-kube-system:

-- Logs begin at Tue 2017-12-19 18:17:04 UTC, end at Thu 2017-12-28 12:19:14 UTC. --
Dec 19 18:17:52 ip-xxx-xxx-xxx-xxx.somedomain.local systemd[1]: Starting install-kube-system.service...
Dec 19 18:17:52 ip-xxx-xxx-xxx-xxx.somedomain.local bash[1041]: activating
...snip...
Dec 19 18:21:02 ip-xxx-xxx-xxx-xxx.somedomain.local retry[2292]: rolebinding "heapster-nanny" created
Dec 19 18:21:02 ip-xxx-xxx-xxx-xxx.somedomain.local systemd[1]: Started install-kube-system.service.
-- Reboot --
Dec 22 15:02:18 ip-xxx-xxx-xxx-xxx.somedomain.local systemd[1]: Starting install-kube-system.service...
Dec 22 15:02:18 ip-xxx-xxx-xxx-xxx.somedomain.local bash[1042]: activating
...snip...
Dec 22 15:02:51 ip-xxx-xxx-xxx-xxx.somedomain.local retry[2257]: configmap "kube-dns" configured
...snip...
Dec 22 15:02:53 ip-xxx-xxx-xxx-xxx.somedomain.local retry[2257]: deployment "kube-dns" configured

Is this the intended behavior?

Thanks!

mumoshu · 2018-04-25T14:55:46Z

@pedrobizzotto Hi! Sorry for the late reply.
Yes, it is definitely an expected behavior. However I'm not satisfied with it.
May I ask for you ideas?

Like:

kube-aws update updates just controller nodes. kube-system components are left untouched.
- You run kubectl apply ... yourself afterwards
A cloudformation custom resource to literally run kubectl apply from outside of controller nodes?
- Sounds a bit overkill

pedrobizzotto · 2018-04-25T17:39:11Z

Hello, thanks for the response,

I don't know yet how hard would it be to implement, but maybe the script called by the systemd unit should check for the existence of the components and only apply if they are missing?

This can, of course, leave the components in an non-optimal state, for example if you mess up the configuration of the deployments after the cluster has booted , but that's something you can run kubectl apply to fix manually later, I think.

fejta-bot · 2019-04-23T10:17:35Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fejta-bot · 2019-05-23T10:59:31Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

fejta-bot · 2019-06-22T11:50:08Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

k8s-ci-robot · 2019-06-22T11:50:15Z

@fejta-bot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

cknowles mentioned this issue Jun 12, 2018

General discussion about alternative node provisioner and maybe alternative host OSes #1359

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 23, 2019

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels May 23, 2019

k8s-ci-robot closed this as completed Jun 22, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Best method to permanently modify kube-dns configuration? #1089

Best method to permanently modify kube-dns configuration? #1089

pedrobizzotto commented Dec 22, 2017 •

edited

mumoshu commented Dec 26, 2017

pedrobizzotto commented Dec 28, 2017

pedrobizzotto commented Dec 28, 2017

mumoshu commented Apr 25, 2018

pedrobizzotto commented Apr 25, 2018

fejta-bot commented Apr 23, 2019

fejta-bot commented May 23, 2019

fejta-bot commented Jun 22, 2019

k8s-ci-robot commented Jun 22, 2019

Best method to permanently modify kube-dns configuration? #1089

Best method to permanently modify kube-dns configuration? #1089

Comments

pedrobizzotto commented Dec 22, 2017 • edited

mumoshu commented Dec 26, 2017

pedrobizzotto commented Dec 28, 2017

pedrobizzotto commented Dec 28, 2017

mumoshu commented Apr 25, 2018

pedrobizzotto commented Apr 25, 2018

fejta-bot commented Apr 23, 2019

fejta-bot commented May 23, 2019

fejta-bot commented Jun 22, 2019

k8s-ci-robot commented Jun 22, 2019

pedrobizzotto commented Dec 22, 2017 •

edited