Container-in-VM: put DNS servers from all ports to resolv.conf #3826

milan-zededa · 2024-03-19T18:54:03Z

When container application has multiple network interfaces, it should be configured to failover between DNS servers collected from all interfaces.

The current behaviour is that we only put DNS servers from the first (eth0) interface into resolv.conf. However, if the uplink port corresponding to the first app interface looses connectivity, name resolution will stop working and app will not try DNS servers from other interfaces (that could be potentially using different uplinks).
However, there is nothing in EVE API that would declare the first app interface as being special and exclusively used for DNS.

Comparing this to Linux or Windows (i.e. VM apps), the default behaviour of the resolver is to iterate over all ports and try every DNS server until one responds. We should therefore replicate the same behaviour in our shim VM created for container applications.

Additionally, in order to make sure that query destined to a user-configured DNS server is sent out through the appropriate application interface, for every NI we use DHCP to propagate host routes (/32) for all DNS (and also NTP) servers (with the NI bridge IP as GW) into applications.

milan-zededa · 2024-03-19T18:56:20Z

@naiming-zededa @gkodali-zededa @eriknordmark I'm not completely sure about this and would like to hear your opinion. Maybe there are some reasons why we pick DNS servers only from eth0 that I'm not aware of.

codecov · 2024-03-19T19:09:58Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 17.51%. Comparing base (e4f2710) to head (67242c0).
Report is 6 commits behind head on master.

Additional details and impacted files

@@           Coverage Diff           @@
##           master    #3826   +/-   ##
=======================================
  Coverage   17.51%   17.51%           
=======================================
  Files           3        3           
  Lines         805      805           
=======================================
  Hits          141      141           
  Misses        629      629           
  Partials       35       35

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

pkg/xen-tools/initrd/udhcpc_script.sh

eriknordmark

I don't think there was a particular reason to only use eth0.

But as we use DNS from multiple interfaces we might even more depend on that the app instance routing works correctly so that the source IP address (of DNS requests) match the interface over which the request is sent. Does the resolver code always leave the source IP address unset and have the kernel pick based on the route?
If the DNS server is multiple hops away then we can't rely on matching a default route since we might have multiple of those for different interfaces. I don't know what of this we should document (to ask the user to configure the static routes to reach the DNS servers) or add code to set it up.

Some shellcheck issues to fix.

milan-zededa · 2024-03-21T11:25:44Z

But as we use DNS from multiple interfaces we might even more depend on that the app instance routing works correctly so that the source IP address (of DNS requests) match the interface over which the request is sent. Does the resolver code always leave the source IP address unset and have the kernel pick based on the route?

Yes, src IP is unset and routing table decides the next hop and the output interface.

But as we use DNS from multiple interfaces we might even more depend on that the app instance routing works correctly so that the source IP address (of DNS requests) match the interface over which the request is sent.

In most cases DNS IP will be the NI bridge IP so routing will work correctly. In case user configures some DNS server(s) from external networks instead, we could generate and propagate static routes for them automatically into the connected apps. Actually, we already have the same situation with NTP servers. My only worry is that while users are able to add their own static routes, they are not able to delete the automatically created ones. So they will not be able to remove/edit these routes towards DNS/NTP servers that we would add automatically.

Some shellcheck issues to fix.

Fixed

eriknordmark · 2024-03-21T12:27:08Z

In most cases DNS IP will be the NI bridge IP so routing will work correctly. In case user configures some DNS server(s) from external networks instead, we could generate and propagate static routes for them automatically into the connected apps. Actually, we already have the same situation with NTP servers. My only worry is that while users are able to add their own static routes, they are not able to delete the automatically created ones. So they will not be able to remove/edit these routes towards DNS/NTP servers that we would add automatically.

They can choose to explicitly list the static routes and not import the ones from DHCP into the NI, right?

We should probably provide some examples for how to configure this with off-subnet ntp/dns/other servers.

pkg/xen-tools/initrd/udhcpc_script.sh

milan-zededa · 2024-03-21T13:40:50Z

They can choose to explicitly list the static routes and not import the ones from DHCP into the NI, right?

EVE currently propagates to applications static IP routes (if any configured) + connected routed (if enabled). Routes from the DHCP of the uplink port are not propagated to applications (only added to the NI routing table on the host side).
If we would automatically generate routes for DNS/NTP and also propagate them to apps, then these routes would not be editable for the user (unless we add some API knobs).

naiming-zededa · 2024-03-21T18:47:10Z

My only worry is that while users are able to add their own static routes, they are not able to delete the automatically created ones. So they will not be able to remove/edit these routes towards DNS/NTP servers that we would add automatically.

User can add a /32 route to their destination of DNS/NTP which should work unless the dhcp propagated routes is also /32, which is not likely.

milan-zededa · 2024-03-22T09:00:13Z

My only worry is that while users are able to add their own static routes, they are not able to delete the automatically created ones. So they will not be able to remove/edit these routes towards DNS/NTP servers that we would add automatically.

User can add a /32 route to their destination of DNS/NTP which should work unless the dhcp propagated routes is also /32, which is not likely.

But in this case we are talking about EVE automatically adding routes for DNS and NTP servers configured by the user for the network instance. Since these are IP addresses without subnet prefix, we would exactly use the /32 prefix for the generated routes. What we can perhaps do, is that if the user also creates /32 static routes for those IPs, then EVE will not be generating those routes and will install the user ones instead.

When container application has multiple network interfaces, it should be configured to failover between DNS servers collected from all interfaces. The current behaviour is that we only put DNS servers from the first (eth0) interface into resolv.conf. However, if the uplink port corresponding to the first app interface looses connectivity, name resolution will stop working and app will not try DNS servers from other interfaces (that could be potentially using different uplinks). However, there is nothing in EVE API that would declare the first app interface as being special and exclusively used for DNS. Comparing this to Linux or Windows (i.e. VM apps), the default behaviour of the resolver is to iterate over all ports and try every DNS server until one responds. We should therefore replicate the same behaviour in our shim VM created for container applications. Additionally, in order to make sure that query destined to a user-configured DNS server is sent out through the appropriate application interface, for every NI we use DHCP to propagate host routes (/32) for all DNS (and also NTP) servers (with the NI bridge IP as GW) into applications. Signed-off-by: Milan Lenco <milan@zededa.com>

milan-zededa · 2024-03-26T11:19:42Z

@eriknordmark @naiming-zededa I have added propagation of /32 routes for user-configured DNS and NTP servers. I cannot envision a realistic scenario where this could cause some problems for the user. After all, it would not make much sense if query towards a DNS/NTP server was sent through a different network instance that it is configured for.

eriknordmark

Do we allow users to specify not servers using names? Or do we require that they be IP addresses

milan-zededa · 2024-03-26T11:32:47Z

Do we allow users to specify not servers using names? Or do we require that they be IP addresses

We require IP addresses for NTP servers: https://github.com/lf-edge/eve/blob/master/pkg/pillar/cmd/zedagent/parseconfig.go#L2052-L2058

eriknordmark

LGTM

github-actions bot requested review from eriknordmark, OhmSpectator, rene, rucoder and shjala March 19, 2024 18:54

eriknordmark reviewed Mar 20, 2024

View reviewed changes

pkg/xen-tools/initrd/udhcpc_script.sh Show resolved Hide resolved

eriknordmark reviewed Mar 20, 2024

View reviewed changes

pkg/xen-tools/initrd/udhcpc_script.sh Outdated Show resolved Hide resolved

eriknordmark requested changes Mar 20, 2024

View reviewed changes

milan-zededa force-pushed the app-resolvconf branch from ac2a858 to de0e704 Compare March 21, 2024 08:58

github-actions bot requested a review from eriknordmark March 21, 2024 08:58

milan-zededa force-pushed the app-resolvconf branch from de0e704 to f68a364 Compare March 21, 2024 09:03

eriknordmark reviewed Mar 21, 2024

View reviewed changes

pkg/xen-tools/initrd/udhcpc_script.sh Outdated Show resolved Hide resolved

milan-zededa force-pushed the app-resolvconf branch from f68a364 to 77eb0e7 Compare March 21, 2024 13:43

github-actions bot requested a review from eriknordmark March 21, 2024 13:43

milan-zededa force-pushed the app-resolvconf branch from 77eb0e7 to 67242c0 Compare March 26, 2024 10:36

milan-zededa marked this pull request as ready for review March 26, 2024 11:15

eriknordmark reviewed Mar 26, 2024

View reviewed changes

eriknordmark approved these changes Apr 7, 2024

View reviewed changes

eriknordmark merged commit 5477648 into lf-edge:master Apr 9, 2024
32 of 36 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Container-in-VM: put DNS servers from all ports to resolv.conf #3826

Container-in-VM: put DNS servers from all ports to resolv.conf #3826

milan-zededa commented Mar 19, 2024 •

edited

Loading

milan-zededa commented Mar 19, 2024

codecov bot commented Mar 19, 2024 •

edited

Loading

eriknordmark left a comment

milan-zededa commented Mar 21, 2024 •

edited

Loading

eriknordmark commented Mar 21, 2024

milan-zededa commented Mar 21, 2024

naiming-zededa commented Mar 21, 2024

milan-zededa commented Mar 22, 2024

milan-zededa commented Mar 26, 2024 •

edited

Loading

eriknordmark left a comment

milan-zededa commented Mar 26, 2024

eriknordmark left a comment

Container-in-VM: put DNS servers from all ports to resolv.conf #3826

Container-in-VM: put DNS servers from all ports to resolv.conf #3826

Conversation

milan-zededa commented Mar 19, 2024 • edited Loading

milan-zededa commented Mar 19, 2024

codecov bot commented Mar 19, 2024 • edited Loading

Codecov Report

eriknordmark left a comment

Choose a reason for hiding this comment

milan-zededa commented Mar 21, 2024 • edited Loading

eriknordmark commented Mar 21, 2024

milan-zededa commented Mar 21, 2024

naiming-zededa commented Mar 21, 2024

milan-zededa commented Mar 22, 2024

milan-zededa commented Mar 26, 2024 • edited Loading

eriknordmark left a comment

Choose a reason for hiding this comment

milan-zededa commented Mar 26, 2024

eriknordmark left a comment

Choose a reason for hiding this comment

milan-zededa commented Mar 19, 2024 •

edited

Loading

codecov bot commented Mar 19, 2024 •

edited

Loading

milan-zededa commented Mar 21, 2024 •

edited

Loading

milan-zededa commented Mar 26, 2024 •

edited

Loading