New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG]: CSI Driver - issue with creation volume from 1 of the worker nodes #1057
Comments
@deshab: Thank you for submitting this issue! The issue is currently awaiting triage. Please make sure you have given us as much context as possible. If the maintainers determine this is a relevant issue, they will remove the needs-triage label and respond appropriately. We want your feedback! If you have any questions or suggestions regarding our contributing process/workflow, please reach out to us at container.storage.modules@dell.com. |
Any update on this issue? |
@deshab : Team will look into this issue shortly |
@deshab : As it is reporting all zeros, it seems like an SDC issue. Please refer to KB article here. https://www.dell.com/support/kbdoc/000213824. |
I'm Dell Employee and getting this message when accessing the KB: "This article is permission based. Find another article." |
@deshab Can you share the details on the version of PowerFlex and the CSI driver version? |
Hi @deshab, can you also share how many arrays were configured in the secret and the sdc version you installed? |
PowerFlex - 3.6-700.013 scli --query_all_sdc | grep xxx |
@deshab can you provide @suryagupta4 with the details on how your secrets were configured. Did you use one PowerFlex array or more than one as part of the secret creation? |
@deshab : Requesting your inputs |
@shanmydell are you referring to K8s secrets or the PowerFlex server side itself? Could you please confirm on this? |
@suryagupta4 : Please look into the inputs provided above |
@suryagupta4 are you referring to K8s secrets or the PowerFlex server side itself? Could you please confirm this? |
@deshab the secret yaml from which you created the secret |
link: 18734 |
Missing link for 18734, everything is looks good here. Here is output: MDM: "198.19.60.28,198.19.56.28"
|
@deshab are you getting same output on both the worker nodes?
|
here is the output: sudo /opt/emc/scaleio/sdc/bin/drv_cfg --query_mdm |
@deshab can you share both the node pod logs for the driver container. |
Working Node: 89-14sudo /opt/emc/scaleio/sdc/bin/drv_cfg --query_mdm sudo /opt/emc/scaleio/sdc/bin/drv_cfg --query_version kubectl logs vxflexos-node-zdmp8 -n kob-vxflexos -c driver Bad Node: 89-15sudo /opt/emc/scaleio/sdc/bin/drv_cfg --query_mdm sudo /opt/emc/scaleio/sdc/bin/drv_cfg --query_version kubectl logs vxflexos-node-pk92r -n kob-vxflexos -c driver |
@deshab looks like the driver was installed a long time ago and the thing I want to see in the logs is refreshed. Can you please clean up the pod, pvc's and reinstall the driver and share the logs again? Please include output from these commands:
|
/sync |
1 similar comment
/sync |
link:19485 |
@suryagupta4 logs are not available for these commands ➜ k describe node wrk-89-14 | grep Labels: -A 10 -B 10 |
@deshab since you didn't provide the output of |
@bharathsreekanth any inputs here? |
Had a customer call on this. The sdc was reporting the correct system id but since the driver node pod was not re-deployed, the topology key was still reporting all 0's, re-spinning the node pod added the correct topology key to the csi node. |
Bug Description
issue with creation volume from 1 of the worker nodes, unable to create volumes in this node. We still have the environment as is if you want to troubleshoot the issue. Please let us know.
Logs
Error log:
kob-elastic-system 31m Warning ProvisioningFailed persistentvolumeclaim/elasticsearch-data-es-kob-es-hot-8 failed to provision volume with StorageClass "vxflexos-xfs": error generating accessibility requirements: topology map[csi-vxflexos.dellemc.com/0000000000000000:csi-vxflexos.dellemc.com] from selected node "wrk-10-x-x-x" is not in requisite: [map[csi-vxflexos.dellemc.com/187e850d57b03e0f:csi-vxflexos.dellemc.com]]
More info:
I saw some extra fields added to the label, In what scenario this was added??? csi-vxflexos.dellemc.com/0000000000000000=csi-vxflexos.dellemc.com,
BAD Node:
beta.kubernetes.io/arch=amd64,beta.kubernetes.io/os=linux,csi-vxflexos.dellemc.com/0000000000000000=csi-vxflexos.dellemc.com,csi-vxflexos.dellemc.com/187e850d57b03e0f=csi-vxflexos.dellemc.com,kubernetes.io/arch=amd64,kubernetes.io/hostname=wrk-10-x-x-x,kubernetes.io/os=linux,route-reflector=,topology.kubernetes.io/zone=AZ2
Good Node:
beta.kubernetes.io/arch=amd64,beta.kubernetes.io/os=linux,csi-vxflexos.dellemc.com/187e850d57b03e0f=csi-vxflexos.dellemc.com,kubernetes.io/arch=amd64,kubernetes.io/hostname=wrk-10-x-x-x,kubernetes.io/os=linux,route-reflector=,topology.kubernetes.io/zone=AZ3
Screenshots
No response
Additional Environment Information
No response
Steps to Reproduce
unknown, the cluster is in STABLE.
Expected Behavior
User should be able to create volumes in all nodes part of the k8s cluster.The user
CSM Driver(s)
CSI Driver PowerFlex
Installation Type
No response
Container Storage Modules Enabled
No response
Container Orchestrator
Kubernetes 1.26.6
Operating System
Ubuntu
The text was updated successfully, but these errors were encountered: