LogStorageAppender should use the LeaderRole from correct term #6346

deepthidevaki · 2021-02-16T07:56:23Z

When LogStorageAppender appends a new record, it gets the current LeaderRole object for each append. Since the role transition in raft and zeebe happens asynchronously, when the role on raft has stepdown to follower, LogStorageAppender may be still running. Then the raft role can become a leader again at a newer term. This means that after a step down (and then again becoming a leader), the appender may try to append with a LeaderRole object at a newer term, while the LogStorageAppender is from a previous term.
https://github.com/zeebe-io/zeebe/blob/e76d9256080e4e3331cfdd786210780e9b00ea35/logstreams/src/main/java/io/zeebe/logstreams/storage/atomix/AtomixLogStorage.java#L49

If it was using the old LeaderRole object to append, it would not be able to append because the role is already closed. It would be better to get the LeaderRole object once, immediately after the role transition to leader. It would prevent inconsistencies as observed in
Ref: #6316 (comment)_

npepinpe · 2021-02-16T08:19:05Z

Generally I would like to separate the appender part from the leader role itself - if it helps here then we should do that first.

Zelldon · 2021-02-17T12:33:45Z

@deepthidevaki do you have time tomorrow to discuss this?

deepthidevaki · 2021-02-17T14:12:32Z

Yes.

@deepthidevaki

6427: [Backport 0.25] Use always new appender r=Zelldon a=Zelldon ## Description Backports #6392 > Previous if the leader close transition took to long and the broker become leader again it could happen that the new appender was used by the old log storage appender. This is now prevented by not using a supplier. It requests the appender once on leader transition and always uses the same object. > > @deepthidevaki you mentioned that you want to prevent stuff based on different term. I think it is already something checked in the ZeebePartition regarding that. If this is not enough for your we can create a follow up issue.  There were some changes necessary since here the journal changes are not available, which means we need to use the ZeebeIndexMapping etc. ## Related issues  closes #6346 ## Definition of Done _Not all items need to be done depending on the issue and the pull request._ Code changes: * [ ] The changes are backwards compatibility with previous versions * [ ] If it fixes a bug then PRs are created to [backport](https://github.com/zeebe-io/zeebe/compare/stable/0.24...develop?expand=1&template=backport_template.md&title=[Backport%200.24]) the fix to the last two minor versions. You can trigger a backport by assigning labels (e.g. `backport stable/0.25`) to the PR, in case that fails you need to create backports manually. Testing: * [ ] There are unit/integration tests that verify all acceptance criterias of the issue * [ ] New tests are written to ensure backwards compatibility with further versions * [ ] The behavior is tested manually * [ ] The change has been verified by a QA run * [ ] The impact of the changes is verified by a benchmark Documentation: * [ ] The documentation is updated (e.g. BPMN reference, configuration, examples, get-started guides, etc.) * [ ] New content is added to the [release announcement](https://drive.google.com/drive/u/0/folders/1DTIeswnEEq-NggJ25rm2BsDjcCQpDape) Co-authored-by: Christopher Zell <zelldon91@googlemail.com>

@deepthidevaki

6426: [Backport 0.26] Use always new appender r=Zelldon a=Zelldon ## Description Backports #6392 > Previous if the leader close transition took to long and the broker become leader again it could happen that the new appender was used by the old log storage appender. This is now prevented by not using a supplier. It requests the appender once on leader transition and always uses the same object. > > @deepthidevaki you mentioned that you want to prevent stuff based on different term. I think it is already something checked in the ZeebePartition regarding that. If this is not enough for your we can create a follow up issue.  There were some changes necessary since here the journal changes are not available, which means we need to use the ZeebeIndexMapping etc. ## Related issues  closes #6346 ## Definition of Done _Not all items need to be done depending on the issue and the pull request._ Code changes: * [ ] The changes are backwards compatibility with previous versions * [ ] If it fixes a bug then PRs are created to [backport](https://github.com/zeebe-io/zeebe/compare/stable/0.24...develop?expand=1&template=backport_template.md&title=[Backport%200.24]) the fix to the last two minor versions. You can trigger a backport by assigning labels (e.g. `backport stable/0.25`) to the PR, in case that fails you need to create backports manually. Testing: * [ ] There are unit/integration tests that verify all acceptance criterias of the issue * [ ] New tests are written to ensure backwards compatibility with further versions * [ ] The behavior is tested manually * [ ] The change has been verified by a QA run * [ ] The impact of the changes is verified by a benchmark Documentation: * [ ] The documentation is updated (e.g. BPMN reference, configuration, examples, get-started guides, etc.) * [ ] New content is added to the [release announcement](https://drive.google.com/drive/u/0/folders/1DTIeswnEEq-NggJ25rm2BsDjcCQpDape) Co-authored-by: Christopher Zell <zelldon91@googlemail.com>

deepthidevaki added the kind/bug Categorizes an issue or PR as a bug label Feb 16, 2021

github-actions bot added the Status: Needs Triage label Feb 16, 2021

deepthidevaki added severity/low Marks a bug as having little to no noticeable impact for the user Impact: Data scope/broker Marks an issue or PR to appear in the broker section of the changelog and removed Status: Needs Triage labels Feb 16, 2021

npepinpe added severity/high Marks a bug as having a noticeable impact on the user with no known workaround Status: Ready and removed severity/low Marks a bug as having little to no noticeable impact for the user Status: Needs Priority labels Feb 16, 2021

Zelldon self-assigned this Feb 17, 2021

Zelldon mentioned this issue Feb 19, 2021

ZeebePartition doesn't handle exceptions properly #6391

Closed

Zelldon added Status: In Progress and removed Status: Ready labels Feb 19, 2021

Zelldon mentioned this issue Feb 19, 2021

Use always new appender #6392

Merged

9 tasks

Zelldon added Status: Needs Review and removed Status: In Progress labels Feb 19, 2021

ghost closed this as completed in 5871e47 Feb 23, 2021

ghost closed this as completed in #6392 Feb 23, 2021

This was referenced Feb 23, 2021

[Backport 0.26] Use always new appender #6426

Merged

[Backport 0.25] Use always new appender #6427

Merged

npepinpe added the Release: 1.0.0-alpha2 label Mar 1, 2021

npepinpe added the Release: 0.26.2 label Mar 30, 2021

npepinpe added the Release: 1.0.0 label May 10, 2021

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LogStorageAppender should use the LeaderRole from correct term #6346

LogStorageAppender should use the LeaderRole from correct term #6346

deepthidevaki commented Feb 16, 2021 •

edited

Loading

npepinpe commented Feb 16, 2021

Zelldon commented Feb 17, 2021

deepthidevaki commented Feb 17, 2021

LogStorageAppender should use the LeaderRole from correct term #6346

LogStorageAppender should use the LeaderRole from correct term #6346

Comments

deepthidevaki commented Feb 16, 2021 • edited Loading

npepinpe commented Feb 16, 2021

Zelldon commented Feb 17, 2021

deepthidevaki commented Feb 17, 2021

deepthidevaki commented Feb 16, 2021 •

edited

Loading