Prevent isolated/decommissioned nodes handle kafka API requests #7829

VadimPlh · 2022-12-19T11:21:23Z

Ideas

The main idea of this pr:

Add new state for redpanda node - isolated, how redpanda process should "discover" that it is isolated
Create way to say clients “Do not communicate with me and send your request to another node”

We need to signal to clients that node can not get any requests anymore. The simplest way to do it is by using Kafka RPC. Clients send metadata requests to get info about the cluster, so we do not want to return wrong information from isolated nodes. So we can do it by using metadata response.

For now we have several types of pings for nodes.

Health_monitor
Node_status_table
Append_entries request from controller leader

So for beginning we can just check that we do not communicate with another nodes from Health_monitor, also we do not get append_entries requests from controller leader + node does not have outside raft heartbits. So after it we can decide that we are isolated and signal client to reconnect to another node.

RFC
Ticket

Backports Required

UX Changes

none

Release Notes

Features

Redpanda nodes now automatically decline Kafka API requests when they detect that they are isolated from the cluster
Cluster configuration properties are added to control node isolation detection: node_isolation_raft_timeout and node_isolation_heartbeat_timeout. Both are 3 seconds by default.

src/v/config/configuration.h

tests/rptest/tests/isolated_decommissioned_node_test.py

src/v/features/feature_table.cc

src/v/raft/consensus.h

src/v/config/configuration.cc

src/v/cluster/metadata_cache.cc

tests/rptest/tests/isolated_decommissioned_node_test.py

VadimPlh · 2023-01-23T11:41:25Z

Please resolve conflicts

resolved

mmaslankaprv · 2023-01-23T16:04:09Z

/ci-repeat 5

andrewhsu · 2023-01-23T16:48:41Z

@mmaslankaprv @VadimPlh i have a feeling tests/rptest/tests/tests/rptest/tests/isolated_decommissioned_node_test.py should instead be tests/rptest/tests/isolated_decommissioned_node_test.py

mmaslankaprv · 2023-01-23T17:11:56Z

@mmaslankaprv @VadimPlh i have a feeling tests/rptest/tests/tests/rptest/tests/isolated_decommissioned_node_test.py should instead be tests/rptest/tests/isolated_decommissioned_node_test.py

@andrewhsu thank you, sorry for this

mmaslankaprv · 2023-01-23T17:28:54Z

/ci-repeat 10 skip-units dt-repeat=32 tests/rptest/tests/isolated_decommissioned_node_test.py

andrewhsu · 2023-01-23T18:05:36Z

sorry, i noticed another typo in the repeat command (based on the syntax defined in #940) so i kicked off a draft PR #8368 but it has the same git commit so it replaced this PR's test jobs.

i'll close the draft PR #8368 and kick off another repeat command as intended in comment but with typo fix.

andrewhsu · 2023-01-23T18:05:51Z

/ci-repeat 10
dt-repeat=32
skip-unit
tests/isolated_decommissioned_node_test.py

andrewhsu · 2023-01-23T18:16:57Z

i've created another draft PR #8369 that runs the ci-repeat command with the full ducktape test path in case specifying the short path does not work: #8369 (comment)

andrewhsu · 2023-01-23T19:33:46Z

@VadimPlh @mmaslankaprv fyi the buildkite job went green (from draft PR #8369 (comment)):
https://buildkite.com/redpanda/redpanda/builds/21696

that used this PR's codebase (plus a dummy commit)

If node know about leader for controller it means node is not isolated.

New setting node_isolation_heartbeat_timeout. How long after the last heartbeat request a node will wait before considering itself to be isolated

Metadata_cache now contains bool flag which signal is node isolated or not. This bool will be updated by new sharded service and cached inside metadata cache

Returns list of all nodes in cluster

New service comunicate swith health_monitor, node_status_table to understand is node isolated For now we have 3 different signal about node communication: * Health_monitor * node status table * raft0 has a leader If all of them are not updated for long time node can be isolated

If node isolated or decomissioned it can not handle kafka requests from client, so in this case we need to signal client comunicate with another broker. For this we need to exclude isolated node from brokers list and return -1 for controller_id, after it client will send metadata request to another broker and will comunicate with it Also we can not put isolated node like leader for partition. To prevent client stuck we gonna add fake leader to force client connect to another broker

VadimPlh · 2023-01-23T21:56:05Z

failure CI Failure (assert in persisted_stm) in EndToEndTopicRecovery.test_restore #8293

github-actions bot added the area/redpanda label Dec 19, 2022

VadimPlh force-pushed the issue-271-part1-change-metadata-responce branch 5 times, most recently from f6cc7c7 to 243796a Compare December 21, 2022 13:13

VadimPlh commented Dec 21, 2022

View reviewed changes

src/v/config/configuration.h Outdated Show resolved Hide resolved

VadimPlh commented Dec 21, 2022

View reviewed changes

src/v/config/configuration.h Outdated Show resolved Hide resolved

VadimPlh force-pushed the issue-271-part1-change-metadata-responce branch from 243796a to 5c0ad4e Compare December 21, 2022 14:12

VadimPlh force-pushed the issue-271-part1-change-metadata-responce branch 3 times, most recently from db3485a to 3a6d3a1 Compare January 3, 2023 17:56

VadimPlh marked this pull request as ready for review January 3, 2023 17:56

VadimPlh requested review from mmaslankaprv and jcsp January 3, 2023 17:56

VadimPlh force-pushed the issue-271-part1-change-metadata-responce branch 2 times, most recently from 0ee43de to 6ae1bc4 Compare January 4, 2023 14:31

jcsp reviewed Jan 5, 2023

View reviewed changes

VadimPlh force-pushed the issue-271-part1-change-metadata-responce branch 10 times, most recently from b498256 to 6232d0d Compare January 17, 2023 18:42

VadimPlh requested a review from jcsp January 17, 2023 18:47

VadimPlh force-pushed the issue-271-part1-change-metadata-responce branch from 6232d0d to 60e933f Compare January 18, 2023 18:09

mmaslankaprv previously approved these changes Jan 23, 2023

View reviewed changes

redpanda-data deleted a comment from mmaslankaprv Jan 23, 2023

mergify bot mentioned this pull request Jan 23, 2023

DNM just running some ci-repeat commands #8368

Closed

andrewhsu mentioned this pull request Jan 23, 2023

DNM run ci repeat command #8369

Closed

VadimPlh added 9 commits January 23, 2023 20:46

cluster: Add check leader for raft0

722cfee

If node know about leader for controller it means node is not isolated.

config: Add node_isolation_heartbeat_timeout

409f30c

New setting node_isolation_heartbeat_timeout. How long after the last heartbeat request a node will wait before considering itself to be isolated

cluster: Add is_node_isolated inside metadata_cache

90280c0

Metadata_cache now contains bool flag which signal is node isolated or not. This bool will be updated by new sharded service and cached inside metadata cache

cluster: Add all_nodes for metadata cache

45f44d5

Returns list of all nodes in cluster

application: Add isolation watcher inside application

6f1a590

admin_api: Add admin_api for is_isolated

3bf60fd

ducktape: Add test to handle isolated node

abbaf02

VadimPlh dismissed mmaslankaprv’s stale review via abbaf02 January 23, 2023 19:48

VadimPlh force-pushed the issue-271-part1-change-metadata-responce branch from eb61517 to abbaf02 Compare January 23, 2023 19:48

mmaslankaprv self-requested a review January 23, 2023 19:49

mmaslankaprv approved these changes Jan 23, 2023

View reviewed changes

piyushredpanda merged commit f81dbdf into redpanda-data:dev Jan 23, 2023

VadimPlh mentioned this pull request Feb 3, 2023

Isolated Redpanda node hangs client trying to allocate producer ID #5648

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent isolated/decommissioned nodes handle kafka API requests #7829

Prevent isolated/decommissioned nodes handle kafka API requests #7829

VadimPlh commented Dec 19, 2022 •

edited by jcsp

VadimPlh commented Jan 23, 2023

mmaslankaprv commented Jan 23, 2023

andrewhsu commented Jan 23, 2023

mmaslankaprv commented Jan 23, 2023 •

edited

mmaslankaprv commented Jan 23, 2023

andrewhsu commented Jan 23, 2023 •

edited

andrewhsu commented Jan 23, 2023

andrewhsu commented Jan 23, 2023

andrewhsu commented Jan 23, 2023 •

edited

VadimPlh commented Jan 23, 2023

Prevent isolated/decommissioned nodes handle kafka API requests #7829

Prevent isolated/decommissioned nodes handle kafka API requests #7829

Conversation

VadimPlh commented Dec 19, 2022 • edited by jcsp

Ideas

Backports Required

UX Changes

Release Notes

Features

VadimPlh commented Jan 23, 2023

mmaslankaprv commented Jan 23, 2023

andrewhsu commented Jan 23, 2023

mmaslankaprv commented Jan 23, 2023 • edited

mmaslankaprv commented Jan 23, 2023

andrewhsu commented Jan 23, 2023 • edited

andrewhsu commented Jan 23, 2023

andrewhsu commented Jan 23, 2023

andrewhsu commented Jan 23, 2023 • edited

VadimPlh commented Jan 23, 2023

VadimPlh commented Dec 19, 2022 •

edited by jcsp

mmaslankaprv commented Jan 23, 2023 •

edited

andrewhsu commented Jan 23, 2023 •

edited

andrewhsu commented Jan 23, 2023 •

edited