Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

KAFKA-14275; KRaft Controllers should crash after failing to apply any metadata record (#12709) #12733

Closed

Conversation

niket-goel
Copy link
Contributor

@niket-goel niket-goel commented Oct 12, 2022

Make all faults in metadata processing on standby controllers be fatal. This is the same behavior-wise as the active controller. This prevents a standby controller from eventually becoming active with incomplete state.

Reviewers: Colin Patrick McCabe cmccabe@apache.org, Jason Gustafson jason@confluent.io
(cherry picked from commit 98a3dcb)

Conflicts were resolved in the following files around availability of Builders for some test classes:
metadata/src/test/java/org/apache/kafka/controller/QuorumControllerTest.java
metadata/src/test/java/org/apache/kafka/controller/QuorumControllerTestEnv.java

…y metadata record (apache#12709)

Make all faults in metadata processing on standby controllers be fatal. This is the same behavior-wise as the active controller. This prevents a standby controller from eventually becoming active with incomplete state.

Reviewers: Colin Patrick McCabe <cmccabe@apache.org>, Jason Gustafson <jason@confluent.io>
(cherry picked from commit 98a3dcb)
@niket-goel
Copy link
Contributor Author

Interesting test failures on this. All are in related code, but not if sure if there are flakes or actual failures. Will run some local tests to verify.

Build / JDK 8 and Scala 2.12 / testDescribeQuorumRequestToBrokers() – kafka.server.KRaftClusterTest 
(need to understand - failing assertion around return value)

Build / JDK 17 and Scala 2.13 / testIncrementalAlterConfigs() – kafka.server.KRaftClusterTest 
(Could be infrastructural - failed to startup due to heap space)

Build / JDK 17 and Scala 2.13 / testConfigurationOperations() – org.apache.kafka.controller.QuorumControllerTest
(Could be a flake - renouncing from a higher epoch than expected)

@niket-goel
Copy link
Contributor Author

All failing tests seem to have failed due to general flakiness.

@niket-goel
Copy link
Contributor Author

PS submitted #12733 to try to address one of the Flaky tests above.

Copy link
Contributor

@hachikuji hachikuji left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Looks like there are some conflicts to resolve.

@hachikuji
Copy link
Contributor

Closing since it looks like @cmccabe already picked this.

@hachikuji hachikuji closed this Oct 13, 2022
@niket-goel niket-goel deleted the kafka-14275-3.3-cherry-pick branch October 13, 2022 21:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
2 participants