New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Second replication begins if first replication is not finished #8
Comments
This is a bug. WIll fix. |
So I guess this was a pull job? I cannot reproduce the issue.
What's still an open issue: where did all the dangling ssh sessions come from? |
I will try to replicate it this weekend, and will report back on my findings. I didn’t spend the time to investigate and record my findings last time... I just remember seeing numerous lines from “sudo pgrep -lf zrepl” |
Were you able to replicate the described behavior? |
OK, I was able to observer the issue on a testing system. I saw lots of defunct processes, most likely ssh processes that timed out but were not |
So I think I fixed the issue in 6b5bd0a --- it just landed in zrepl master. |
Hi Christian, |
During first replication of many Gigabytes of data, I initially had the interval of the pull job set as 10m, and the first replication would not be finished by the time the second one was called to start. I checked the status many hours later and could see numerous ssh sessions running which led me to believe multiple replication jobs were now running at once (which I dont think should ever happen). I expected that if another replication job was called to start before the previous had finished, the new job would just be cancelled entirely.
I did not look into the state of my replicated data, or if the replications were proceeding ok. It was purely the fact that multiple zrepl ssh sessions were running that led me to believe this was the behaviour.
The text was updated successfully, but these errors were encountered: