Partition stays in force configuration after force failover #17334
Labels
component/zeebe
Related to the Zeebe component/team
kind/bug
Categorizes an issue or PR as a bug
severity/mid
Marks a bug as having a noticeable impact but with a known workaround
version:8.5.1
Marks an issue as being completely or in parts released in 8.5.1
version:8.6.0-alpha1
Label that represents issues released on verions 8.6.0-alpha1
In some failed e2e multi-region failover test, failback took long time because a partition did not come out of forced configuration for almost 1hr. Eventually it succeeds though, but the test failed because of the timeout. It is not expected to take so long. Coming out of force configuration should be done with in seconds after force configuration is succeeded.
Example of such a run: https://camunda.slack.com/archives/C013MEVQ4M9/p1711523599459309
We can see the following error repeated for almost 1 hour. Then eventually the operation succeeds (probably after a leader change.)
related to #16126
The text was updated successfully, but these errors were encountered: