-
Notifications
You must be signed in to change notification settings - Fork 458
Bug 1749446: Creates alternative CNI configuration directory for the cluster network operator #1105
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug 1749446: Creates alternative CNI configuration directory for the cluster network operator #1105
Conversation
|
/hold |
|
This would probably be better suited as a template... Take look in https://github.com/openshift/machine-config-operator/tree/master/templates/common/_base/files and see what you think. |
|
@kikisdeliveryservice -- that's a monstrous help, thank you! I think this is just what I'm looking for, would I create a And just create it in a file like |
15f58d3 to
a1f1ec7
Compare
|
/retest |
|
e2e-aws died waiting for bootstrap completion. From the gathered tarball's Would be nice if they mentioned the condition that had timed out ;). There are a number of error lines in So I dunno what happened. We'll see how the retest goes. |
|
@wking I've been looking through the logs and haven't figured it out.. thought maybe retesting will help |
|
Oh, thinking this PR over a bit more gave me an idea to grep for in the gathered tarball ;) $ grep -r multus/cni
...
control-plane/10.0.129.21/containers/kube-multus-e7ca15b25b701bdab39b49dc5534db009ff6f72c2905573827d542966adfc33c.log:2019-09-05T20:39:29+0000 Generating Multus configuration file using files in /host/var/run/multus/cni/net.d...
...
$ cat control-plane/10.0.129.21/containers/kube-multus-e7ca15b25b701bdab39b49dc5534db009ff6f72c2905573827d542966adfc33c.log
2019-09-05T20:39:29+0000 Generating Multus configuration file using files in /host/var/run/multus/cni/net.d...
2019-09-05T20:39:29+0000 Attemping to find master plugin configuration, attempt 0
2019-09-05T20:39:34+0000 Attemping to find master plugin configuration, attempt 5
...
2019-09-05T20:49:26+0000 Attemping to find master plugin configuration, attempt 595
2019-09-05T20:49:31+0000 ERR: {Multus could not be configured: no master plugin was found.}I don't understand multus enough to know what that means. But maybe we need to drop in more than the empty directory holder? |
|
I saw multus is somehow connected to sdn: |
|
Thanks for triggering the restart. The |
|
cni-plugins containers look happy, although we don't have logs for them: $ wc -l control-plane/*/containers/cni-plugins-*.log
0 control-plane/10.0.129.21/containers/cni-plugins-2e8ba60c18074801d4f2b9847d5a74170faab910085d6e02761cdaa4289d652b.log
0 control-plane/10.0.134.250/containers/cni-plugins-fab3a0e40eefa7943a0cab0129b33bb1874ed782446b4379c53c8f7f14c7f33f.log
0 control-plane/10.0.159.6/containers/cni-plugins-a08e3f7cae4781717866a444e8da3951241eb73e17ce33cc812f004cd66f478d.log
0 total
$ jq -r '.status | .finishedAt + "\t" + .state + "\t" + (.exitCode | tostring)' control-plane/*/containers/cni-plugins-*.inspect
2019-09-05T20:29:24.00616605Z CONTAINER_EXITED 0
2019-09-05T20:29:23.816250778Z CONTAINER_EXITED 0
2019-09-05T20:29:24.107361774Z CONTAINER_EXITED 0Also install-cni-plugins containers: $ head control-plane/*/containers/install-cni-plugins-*.log
==> control-plane/10.0.129.21/containers/install-cni-plugins-9709d95c620224f019a1d7edaf07b73272bbd7d2226fc8e181427d67e967d8d7.log <==
+ cp -f /usr/src/plugins/bin/loopback /usr/src/plugins/bin/host-local /host/opt/cni/bin
==> control-plane/10.0.134.250/containers/install-cni-plugins-811d847504753eb2c69605fbdee748b74399a2d062a53bc54d189091ed843e22.log <==
+ cp -f /usr/src/plugins/bin/loopback /usr/src/plugins/bin/host-local /host/opt/cni/bin
==> control-plane/10.0.159.6/containers/install-cni-plugins-1f8b64a80db4e68a55924cff7c17960eb35feaf704d145b4b52dfc0d786b8b4f.log <==
+ cp -f /usr/src/plugins/bin/loopback /usr/src/plugins/bin/host-local /host/opt/cni/bin
$ jq -r '.status | .finishedAt + "\t" + .state + "\t" + (.exitCode | tostring)' control-plane/*/containers/install-cni-plugins-*.inspect
2019-09-05T20:29:31.662820192Z CONTAINER_EXITED 0
2019-09-05T20:29:31.718027272Z CONTAINER_EXITED 0
2019-09-05T20:29:31.726282675Z CONTAINER_EXITED 0SDN containers seem to be dying with: $ tail -n2 control-plane/10.0.129.21/containers/sdn-b43bd292f53e0c76cea4daf0d6b79d7f6d3ae6207d4122fa4df9550a9c817a75.log
I0905 20:55:27.453413 4389 healthcheck.go:42] waiting for OVS to start: dial unix /var/run/openvswitch/db.sock: connect: no such file or directory
I0905 20:55:28.453428 4389 healthcheck.go:42] waiting for OVS to start: dial unix /var/run/openvswitch/db.sock: connect: no such file or directory |
|
I think the bug should be: https://bugzilla.redhat.com/show_bug.cgi?id=1749446 the one linked above has been marked as a dupe for this one. |
|
/retitle [WIP] Bug 1749446: Creates alternative CNI configuration directory for the cluster network operator |
|
@dougbtv: This pull request references Bugzilla bug 1749446, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
Still not back to a desk, but I'm wondering if /var/run (edit: as a parent directory of the for created by this patch) is getting created early or something (edit: with different perms?)? Otherwise, I haven't seen this out of openshift-sdn before, that error as posted. |
a1f1ec7 to
e6b29a8
Compare
|
New commit creates |
|
/retest Didn't quite look the same, API didn't come up. I'm not 100% positive, but, this looks like a potential flake. So, giving it another run. |
e6b29a8 to
c0103ab
Compare
|
I got @danwinship to take a look, and he's noted that:
I'm starting to wonder if there was any legs to my thought earlier about Edit: trying the |
c0103ab to
5c69661
Compare
|
We can't create files in You need to install a systemd tmpfiles snippet instead. |
|
It's a lot like having an RPM install files in |
|
Another alternative is to have an init container pre-create the directories. |
5c69661 to
e189be7
Compare
|
Thanks @cgwalters!! Huge help. I've got a new commit pushed where I added another instruction to Added: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Mind changing your commit message to something like:
templates: Create /run/multus/cni/net.d/
The multus containers need this to exist.
https://bugzilla.redhat.com/show_bug.cgi?id=1749446
See https://chris.beams.io/posts/git-commit/ (and git log in this repository)
|
/approve |
The multus containers need this to exist. https://bugzilla.redhat.com/show_bug.cgi?id=1749446
e189be7 to
3e11bb7
Compare
|
Thanks @cgwalters for the comments, got an updated commit referencing |
|
/lgtm But you have both WIP and a |
kikisdeliveryservice
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
glad this got sorted out. thanks @cgwalters !!
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: cgwalters, dougbtv, kikisdeliveryservice The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
@dougbtv do we feel this is good to go now? I'll lift the hold if that's the case |
|
@runcom looking good to me! Thanks! /hold cancel |
|
@dougbtv: All pull requests linked via external trackers have merged. Bugzilla bug 1749446 has been moved to the MODIFIED state. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
This is a WIP to look at addressing release blocker bz @ https://bugzilla.redhat.com/show_bug.cgi?id=1749448
NOTE: Originally submitted as a test. I am actively looking for feedback as to where this code to create a directory should exist. Originally, I simply found a convenient part to sketch it in (otherwise, I am somewhat unfamiliar with the MCO)