-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
snapshot stucks and PVC turns readOnlny #66
Comments
Sorry for the late response. If you still see this issue: could you include the kernel log from the node that is stopping progress. See the lines:
So hopefully node30 holds some clues. By the way, which version of piraeus + drbd are you using? |
Hi @WanzenBug! Uploaded the full kernel-log from that timeframe. we use the latest version of piraeus with drbd version 9.0.27-1 |
Thanks! Looks like a process is keeping the device open, either Is there perhaps some monitoring agent running on node30 that uses these commands? If you are running some LVM commands directly on the host (like
Otherwise LVM can deadlock (as seen in the |
We are using Zabbix and monitoring the mounts for free space, this could be the reason for locking. |
We didnt had much time to test it in different ways but needed a stable solution and solved it in an other way. |
Hi,
we have a MYSQL DB running with a linstore/piraeus PVC and we try to create snapshot for the mysql-data-pvc, but it does not work stable, sometimes it works sometimes not.
Often we have the issue that the "READYTOUSE" (on snapshot) does not turn to "true" and after some time the PVC thats in use by MYSQL turns readonly and the MYSQL-Statefullset turns inactive.
It takes about 20-30mins until the PVC can be used again by the MYSQL-Pod. When we scale down and up, it takes about 20-30 mins until the PVC can be mounted.
We want to use snapshots/cloning on our MYSQL-Service to create backups/dumps of the DB, but this issue happens too often that we cant really use the snapshot/clone features.
We have a script running to create a snapshot and it sends first a "LOCK TABLES FOR BACKUP" to the MYSQL-DB that there are no write operations when a snapshot is taken.
Our clusters are running on rancher
the nodes OS is Ubuntu 18.04, we use LVM as storagepool
we tried it on K8S v1.18.14, after we had this issue, we updated our K8S cluster to v1.19.7 but we have still the same issue.
In the dmesg outputs of a node we could see this when we tried to scale down and up the MYSQL-Statefullset:
The text was updated successfully, but these errors were encountered: