-
I want adjust the time it takes for a Ceph OSD to be marked down. During failover the OSD pod will go into pending state waiting for the server to come back online but when checking "ceph status" all OSDs are UP & IN. During this time "ceph osd status" cannot load and new PVCs cannot be created until ceph marks it as down. After about 30+ minutes it will finally mark down. The cluster is able to recover when the node rejoins, but having a 30 minute down period is not ideal. What is the ceph config to adjust this timeout. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
See the OSD config topic. Perhaps you need to modify |
Beta Was this translation helpful? Give feedback.
See the OSD config topic. Perhaps you need to modify
osd_fast_fail_on_connection_refused
.