Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

vsphere nodeip-configuration.service: run node-ip binary from container #1829

Merged

Conversation

vrutkovs
Copy link
Member

- What I did
Replaced python nodeip-finder script with a baremetal's version running in container. This is required on FCOS, which doesn't have python interpreter. Also, this would deduplicate fixes for the same functionality

- How to verify it
vSphere IPI install should pass

- Description for the changelog
Container is used to determine node IP for vSphere

/cc @jcpowermac @patrickdillon

@jcpowermac
Copy link
Contributor

/test e2e-vsphere

@vrutkovs
Copy link
Member Author

openshift/release#9702 would add IPI tests. UPI flow is not affected with this change

@vrutkovs
Copy link
Member Author

/test e2e-vsphere-upi
/test e2e-vsphere

@jcpowermac
Copy link
Contributor

Let's retry IPI - bootstrap didn't complete
/test e2e-vsphere

@jcpowermac
Copy link
Contributor

/test e2e-vsphere

@jcpowermac
Copy link
Contributor

cri-o is unhappy, that is the wrong ip for the node

2: ens192: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq state UP group default qlen 1000
    link/ether 00:50:56:b9:5e:70 brd ff:ff:ff:ff:ff:ff
    inet 139.178.94.234/25 brd 139.178.94.255 scope global dynamic noprefixroute ens192
Jun 17 01:55:43 ci-op-jgt1ifhj-6b38f-ss4gt-master-0 crio[2805]: time="2020-06-17 01:55:43.015599455Z" level=fatal msg="Failed to start streaming server: listen tcp 139.178.94.235:10010: bind: cannot assign requested address"
Jun 17 01:55:43 ci-op-jgt1ifhj-6b38f-ss4gt-master-0 systemd[1]: crio.service: Main process exited, code=exited, status=1/FAILURE
Jun 17 01:55:43 ci-op-jgt1ifhj-6b38f-ss4gt-master-0 systemd[1]: crio.service: Failed with result 'exit-code'.

@vrutkovs
Copy link
Member Author

/test e2e-vsphere

@vrutkovs
Copy link
Member Author

Ah, the service runs right after the firstboot, records that IP - and then machine-config-daemon-host.service pivots to new machine-os-content, reboots - and VM gets another IP

@vrutkovs
Copy link
Member Author

/test e2e-vsphere

@vrutkovs vrutkovs force-pushed the vsphere-nodeipfinder-podman branch from 26dc1b9 to 38b2340 Compare June 17, 2020 09:41
@vrutkovs
Copy link
Member Author

/test e2e-vsphere

@vrutkovs vrutkovs force-pushed the vsphere-nodeipfinder-podman branch from 38b2340 to 85441f3 Compare June 17, 2020 11:20
@vrutkovs
Copy link
Member Author

/test e2e-vsphere

@vrutkovs vrutkovs force-pushed the vsphere-nodeipfinder-podman branch from 85441f3 to 1525b94 Compare June 17, 2020 11:35
@vrutkovs
Copy link
Member Author

/test e2e-vsphere

@vrutkovs vrutkovs force-pushed the vsphere-nodeipfinder-podman branch from 1525b94 to ce18b1f Compare June 17, 2020 11:53
@vrutkovs
Copy link
Member Author

/test e2e-vsphere

Use containerized golang binary instead python script. It also restarts systemd daemon to have crio settings applied to the loaded unit
@vrutkovs vrutkovs force-pushed the vsphere-nodeipfinder-podman branch from ce18b1f to 16be8fc Compare June 17, 2020 13:21
@vrutkovs
Copy link
Member Author

/test e2e-vsphere

@vrutkovs
Copy link
Member Author

Cleaned up the commits and added systemctl daemon-reload so that cri-o would catch new settings if node IP changes between first boot and post-pivot boot

This ensures base OS would not require python on the host
@vrutkovs
Copy link
Member Author

/test e2e-vsphere

@openshift-ci-robot
Copy link
Contributor

@vrutkovs: The following tests failed, say /retest to rerun all failed tests:

Test name Commit Details Rerun command
ci/prow/e2e-vsphere 74c7e36 link /test e2e-vsphere
ci/prow/e2e-metal-ipi 74c7e36 link /test e2e-metal-ipi
ci/prow/e2e-aws-scaleup-rhel7 74c7e36 link /test e2e-aws-scaleup-rhel7

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@jcpowermac
Copy link
Contributor

/lgtm
Thanks @vrutkovs !

@openshift-ci-robot openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Jun 17, 2020
@cgwalters
Copy link
Member

/approve

But I would really like to see us rethink this stuff at some point after 4.6 because we have much more powerful abilities since openshift/enhancements#291 particularly for VSphere to control exactly what the node is doing from the first boot.

@openshift-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: cgwalters, jcpowermac, vrutkovs

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci-robot openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jun 17, 2020
@openshift-merge-robot openshift-merge-robot merged commit da05f8b into openshift:master Jun 18, 2020
@vrutkovs vrutkovs deleted the vsphere-nodeipfinder-podman branch September 16, 2020 22:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. lgtm Indicates that a PR is ready to be merged.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

5 participants