-
Notifications
You must be signed in to change notification settings - Fork 765
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
intermittent failure to resolve DNS records for services #4222
Comments
Thank you for reporting this @javacruft . Is it possible to have an inspection report? |
I was looking into a failed deployment and seemed to hit this issue: |
@gnuoy thanks for the report, we'll have a look! |
I have another deployment with this symptom and I've narrowed it down to a bit. I've taken a look at the dns query openstack uses and the interesting bits are that it uses udp and enables the search option. On a working system:
On a broken system:
(Note the trailing '.' on the fqdn in the second query) I can also 'fix' the broken system by updating the ndots option in /etc/resolv.conf from |
Inspection report from broken deployment mentioned in previous comment: |
Using
On the broken system :
On the broken system:
I wonder if .maas is getting added on to the queries which don't have a trailing dot (so they aren't fqdn's) and then that causes (me frantically waves hand ) an issue with the ndot setting. On both systems the dns plugin was enabled without specifying any nameservers eg |
Thanks for sharing @gnuoy . I have similar issue, deploying microstack on newly installed Ubuntu 22.04 by following instruction from https://microstack.run/docs/multi-node is always failing with "Error: Timed out while waiting for model 'openstack' to be ready". Running "juju status -m openstack" show that neutron and nova status is stuck at "blocked": Analyzing logs from nova-conductor show this error: neutron-server shows similar error: Editing ndots options from /etc/resolv.conf inside neutron-server and nova-conductor pod get me pass that error. However nova-conductor encountered another error: After several time of Ubuntu re-installation, then found out about microstack tear-down procedure (https://discourse.ubuntu.com/t/tear-down-your-openstack-lab-environment/25078/11), I've finally point the issue to "search" options in /etc/resolv.conf (coming from netplan config). So after configuring "search" to blank ("[]" in netplan or "." resolv.conf), the sunbeam bootstrap process can finish succesfully. Summary: |
Summary
During testing of OpenStack Sunbeam we occasionally see an issue where pods are unable to resolve the hostname associated with another service in the same deployment:
https://bugs.launchpad.net/snap-openstack/+bug/2033680
This typically materialises when something in the pod tries to connect to the remote service:
(2003, "Can't connect to MySQL server on 'nova-api-mysql-router.openstack.svc.cluster.local' ([Errno -2] Name or service not known)")
DNS addon is enabled.
What Should Happen Instead?
Hostname of service should be resolvable.
Reproduction Steps
We're not able to reproduce this error consistently.
Introspection Report
Not currently collected - have requested.
Can you suggest a fix?
Not based on current information - debugging or additional log collection would be great.
Are you interested in contributing with a fix?
no
The text was updated successfully, but these errors were encountered: