-
Notifications
You must be signed in to change notification settings - Fork 392
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
oVirt/RHV disabled nodeip-configuration.service #2529
Comments
Could you PTAL since I think you are more familiar? |
also for ovirt /assign @rgolangh |
/assign @Gal-Zaidman |
Seems like #2567 is related |
+1 Looks like it should fix this. |
Just for the record: Enabling the I will report back and close the issue if it works without manual intervention. |
Closing issue reopen if required. |
Description
Our RHV IPI installed clusters (4.6.21, 4.6.24) are showing the
nodeip-configuration.service
as disabled (enabled: false
) in00-master
and00-worker
machine config resources. We are currently investigating a major cluster stability problem since our upgrade from 4.6.18 to 4.6.22 in one cluster. This lead us to another issue (https://bugzilla.redhat.com/show_bug.cgi?id=1940939) that discribes a problem with this service, but is explicitly mentioning that the service is enabled (but not working properly). The --node-ip parameter of kubelet is empty, resulting in a flip-flop behavior on nodes binding 2 ips (like the ingress or api IPs).As far as I understand the code at hand the service should be enabled, according to following source-code parts, because ingress and API IPs are set in the referenced resource (controllerconfig/machine-config-controller):
machine-config-operator/templates/common/on-prem/units/nodeip-configuration.service.yaml
Line 2 in 9c6c2bf
machine-config-operator/pkg/controller/template/render.go
Line 442 in 9c6c2bf
machine-config-operator/pkg/operator/render.go
Line 236 in 9c6c2bf
A current workaround is to start the nodeip-configuration service manually, calling systemctl daemon-reload and restarting kubelet. After that the --node-ip argument is showing the right ip and the cluster starts to stabalize.
Steps to reproduce the issue:
nodeip-configuration.service
is disabled (enabled: false)Describe the results you received:
Describe the results you expected:
Additional information you deem important (e.g. issue happens only occasionally):
We installed a new OCP 4.6.21 Cluster yesterday, which shows the same misconfiguration, but runs stable for now. Also the unstable cluster was updated to 4.6.24 today showing no change in behavior.
Output of
oc adm release info --commits | grep machine-config-operator
:Additional environment details (platform, options, etc.):
RHV 4.4
The text was updated successfully, but these errors were encountered: