New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Snapshot is not valid. There is no checksum file. #6377
Comments
It looks like that persist is called twice, which causes this problem. We see It might happen because the SnapshotDirector is calling |
6428: [BACKPORT 0.26] fix(broker): fix race condition in persisting snapshot r=Zelldon a=MiguelPires ## Description Backports #6383. No changes were made to the PR. ## Related issues closes #6377 ## Definition of Done _Not all items need to be done depending on the issue and the pull request._ Code changes: * [ ] The changes are backwards compatibility with previous versions * [ ] If it fixes a bug then PRs are created to [backport](https://github.com/zeebe-io/zeebe/compare/stable/0.24...develop?expand=1&template=backport_template.md&title=[Backport%200.24]) the fix to the last two minor versions. You can trigger a backport by assigning labels (e.g. `backport stable/0.25`) to the PR, in case that fails you need to create backports manually. Testing: * [ ] There are unit/integration tests that verify all acceptance criterias of the issue * [ ] New tests are written to ensure backwards compatibility with further versions * [ ] The behavior is tested manually * [ ] The change has been verified by a QA run * [ ] The impact of the changes is verified by a benchmark Documentation: * [ ] The documentation is updated (e.g. BPMN reference, configuration, examples, get-started guides, etc.) * [ ] New content is added to the [release announcement](https://drive.google.com/drive/u/0/folders/1DTIeswnEEq-NggJ25rm2BsDjcCQpDape) Co-authored-by: Miguel Pires <miguel.pires@camunda.com>
6429: [BACKPORT 0.25] fix(broker): fix race condition in persisting snapshot r=Zelldon a=MiguelPires ## Description Backports #6383. No changes were made to the PR. ## Related issues closes #6377 ## Definition of Done _Not all items need to be done depending on the issue and the pull request._ Code changes: * [ ] The changes are backwards compatibility with previous versions * [ ] If it fixes a bug then PRs are created to [backport](https://github.com/zeebe-io/zeebe/compare/stable/0.24...develop?expand=1&template=backport_template.md&title=[Backport%200.24]) the fix to the last two minor versions. You can trigger a backport by assigning labels (e.g. `backport stable/0.25`) to the PR, in case that fails you need to create backports manually. Testing: * [ ] There are unit/integration tests that verify all acceptance criterias of the issue * [ ] New tests are written to ensure backwards compatibility with further versions * [ ] The behavior is tested manually * [ ] The change has been verified by a QA run * [ ] The impact of the changes is verified by a benchmark Documentation: * [ ] The documentation is updated (e.g. BPMN reference, configuration, examples, get-started guides, etc.) * [ ] New content is added to the [release announcement](https://drive.google.com/drive/u/0/folders/1DTIeswnEEq-NggJ25rm2BsDjcCQpDape) Co-authored-by: Miguel Pires <miguel.pires@camunda.com>
Describe the bug
Snapshot can't be persisted because the checksum file is missing.
It seems like we have a race condition between taking and persisting the snapshot.
There are no other errors which indicate problem on writing the snapshot.
Error group https://console.cloud.google.com/errors/CIzr1vKOuue-DA?service=zeebe&time=P7D&project=camunda-cloud-240911&authuser=1
To Reproduce
Not sure seems to be a race condition.
Expected behavior
No race condition and that I can persist my snapshot.
Log/Stacktrace
If possible add the full stacktrace or Zeebe log which contains the issue.
Full Stacktrace
Environment:
The text was updated successfully, but these errors were encountered: