New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make worker depend on network-online.target to avoid networking errors #3673
Conversation
* This hopefully avoids the worker being stuck with the error "Address family for hostname not supported" (see https://progress.opensuse.org/issues/78390#note-38) * According to the documentation "its primary purpose is network client software that cannot operate without network"; I suppose our worker falls into that category (from https://www.freedesktop.org/wiki/Software/systemd/NetworkTarget)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've looked over the systemd docs, and this seems to be the correct solution.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The correct documentation for systemd also explains that code should always be designed to cope with a startup before network is ready. This is the wrong approach. If you say "perl is to blame" then we still need to change this
Nobody said Perl was to blame. As far as we've been able to track it down so far |
I also don't like this solution. It is certainly bad that our process is stuck in an error state even though the error is gone. That's also why I took so much time to investigate this further before creating this PR. However, I'm not sure what causes this problem. Most likely it is some internal caching of glibc's By the way, as far as I understand the documentation they are mainly talking about servers here. For clients (and this PR is about a client) the quote I've put in the commit message seems more relevant. |
That's how i interpret that section as well. The possible approaches mentioned are all server specific. |
I just stumbled over |
As discussed in the last weekly meeting I think there are other potential problems that this approach could cause but if you want to try that out on o3 or osd first and have good results then we can also accept the change in openQA repo here. Otherwise I suggest to go with #3676 first |
Let's go with #3676 first and let's not try two things at the same time. So I'm closing this for now. |
I'm not sure whether this would work anyways. The documentation mentions that the right "wait" service needs to be enabled as well but they only consider |
"Address family for hostname not supported"
(see https://progress.opensuse.org/issues/78390#note-38)
software that cannot operate without network"; I suppose our worker falls
into that category
(from https://www.freedesktop.org/wiki/Software/systemd/NetworkTarget)