tx_migration: avoid ping pong of requests between brokers #15953

bharathv · 2024-01-04T22:53:02Z

When leader table is stale (say at startup or during failures), the
current code can result in a ping pong of requests between two brokers
in a tight loop.

Example

tx_migration_replicate dispatched from node 1 to node 2 (because node 1
thinks node 2 is the leader)
node 2 dispatches the request back to node 1 because it thinks node 1
is the leader.

Until leadership stabilizes this results in a huge pile up of requests
which manifested as an oversized allocation.

Don't think a router is the right choice in the handler as the handler
is supposed to process the request locally. If it returns an error, it
is propagated to the source router which dispatches to the correct
leader. This also breaks the ping pong loop as the source router has
sleeps induced for retry backoff.

Fixes: #15901

Backports Required

Release Notes

none

When leader table is stale (say at startup or during failures), the current code can result in a ping pong of requests between two brokers in a tight loop. Example - tx_migration_replicate dispatched from node 1 to node 2 (because node 1 thinks node 2 is the leader) - node 2 dispatches the request back to node 1 because it thinks node 1 is the leader. Until leadership stabilizes this results in a huge pile up of requests which manifested as an oversized allocation. Don't think a router is the right choice in the handler as the handler is supposed to process the request locally. If it returns an error, it is propagated to the source router which dispatches to the correct leader. This also breaks the ping pong loop as the source router has sleeps induced for retry backoff.

vbotbuildovich · 2024-01-05T00:59:56Z

new failures in https://buildkite.com/redpanda/redpanda/builds/43450#018cd6dd-f8b9-42e9-8821-73b33cbb5da4:

"rptest.tests.internal_topic_protection_test.InternalTopicProtectionLargeClusterTest.test_schemas_topic"

bharathv · 2024-01-05T02:07:11Z

/ci-repeat 1
dt-repeat=20
skip-units
skip-redpanda-build
tests/rptest/tests/tx_coordinator_migration_test.py::TxCoordinatorMigrationTest.test_migrating_tx_manager_coordinator

bharathv · 2024-01-05T02:09:00Z

Failure is a known issue #15944

vbotbuildovich · 2024-01-06T04:42:39Z

/backport v23.3.x

A tight forward-to-leader loop has been discovered in a test where metadata about leader is out of date: redpanda-data#17873. Instead, we remove the forwarding from the request handler and do it only once on the original invoker. In `id_allocator_frontend::allocate_id` we call `allocate_router::process_or_dispatch` which will do the redirect and retry if the target node returns an error/does not respond. It also has backoff built in. This is a fix very similar to one described in redpanda-data#15953. Fixes redpanda-data#17873

A tight forward-to-leader loop has been discovered in a test where metadata about leader is out of date: redpanda-data#17873. Instead, we remove the forwarding from the request handler and do it only once on the original invoker. In `id_allocator_frontend::allocate_id` we call `allocate_router::process_or_dispatch` which will do the redirect and retry if the target node returns an error/does not respond. It also has backoff built in. This is a fix very similar to one described in redpanda-data#15953. Fixes redpanda-data#17873 (cherry picked from commit f388566)

bharathv added 2 commits January 4, 2024 14:35

leader_router: improve logging

c760537

github-actions bot added the area/redpanda label Jan 4, 2024

bharathv requested review from mmaslankaprv, ztlpn, ballard26 and rockwotj January 5, 2024 02:39

rockwotj approved these changes Jan 5, 2024

View reviewed changes

piyushredpanda merged commit 8174bc1 into redpanda-data:dev Jan 6, 2024
18 of 20 checks passed

This was referenced Jan 6, 2024

[v23.3.x] Oversized allocation: 282624 bytes in rpc::transport::make_response_handler #15979

Closed

[v23.3.x] tx_migration: avoid ping pong of requests between brokers #15980

Merged

bharathv deleted the tx_migration_fix_alloc branch January 8, 2024 16:13

nvartolomei mentioned this pull request Apr 16, 2024

CORE-2378 id_allocator: do not forward requests beyond first hop #17892

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tx_migration: avoid ping pong of requests between brokers #15953

tx_migration: avoid ping pong of requests between brokers #15953

bharathv commented Jan 4, 2024

vbotbuildovich commented Jan 5, 2024

bharathv commented Jan 5, 2024

bharathv commented Jan 5, 2024

vbotbuildovich commented Jan 6, 2024

tx_migration: avoid ping pong of requests between brokers #15953

tx_migration: avoid ping pong of requests between brokers #15953

Conversation

bharathv commented Jan 4, 2024

Backports Required

Release Notes

vbotbuildovich commented Jan 5, 2024

bharathv commented Jan 5, 2024

bharathv commented Jan 5, 2024

vbotbuildovich commented Jan 6, 2024