Processing requests in order in Kafka layer #1496

mmaslankaprv · 2021-05-28T13:44:13Z

Cover letter

Changed the way how we process requests in Redpanda Kafka protocol implementation. Previously we processed requests in background, this allowed us to process multiple requests at a time but it may cause issues since requests were not guaranteed to be processed in the same order as they were received. Implemented handling Kafka requests in foreground, this way we normally process one request at a time per connection in exactly the same order as the requests were received.

Produce and commit offsets requests are treated differently as they leverage two phase processing. We process request dispatch in foreground, after this first phase request processing order is guaranteed. Then the second phase is processed in background. This approach allow us to process multiple produce & offset commit requests at a time without compromising ordering.

Fixes: N/A

Release notes

Full compatibility with Kafka with regards to requests ordering.

dotnwat

this is awesome and I think its really close to going in. I think the only real important thing in my feedback in (1) the setting of the exception that crosses cores (see produce.cc feedback) I'm not sure its an issue but its worth looking at closely, and (2) it seems like all of the cases where the cross-core promise is set that that cross-core traffic could be done in the background (without appropriate tracking via gates). but (2) is probably a future optimization. (3) can we add a comment to group_commit / produce about how the cross core promise works (potentially removing the foreign pointer) and stating what the rules are for interacting with that pointer/promise?

src/v/kafka/server/requests.cc

src/v/kafka/server/connection_context.cc

src/v/kafka/server/handlers/produce.cc

dotnwat · 2021-06-14T01:35:15Z

src/v/kafka/server/handlers/produce.cc

-    });
+
+    auto dispatch = ss::make_foreign<std::unique_ptr<ss::promise<>>>(
+      std::make_unique<ss::promise<>>());


I'm not sure a foreign ptr here is necessary since you explicitly handle the cross-core stuff yourself. maybe consider using a normal unique ptr? in either case, I do think that there should be a short comment here about how the cross-core promise signaling works

I did previously used std::unique_ptr the problem is that lambda is handled on foreign core, deletion of lambda captures is done on the remote core, it causes segmentation faults.

it sounds like the segmentation fault is a race condition? normal memory can be freed on a remote core without a foreign pointer which is an optimization.

it is indeed a race condition, using std::unique_ptr across shards isn't safe, logic inside the std::unique_ptr is triggered twice and it may lead to use after free situations.

Can you explain? AFAICT foreign pointer should be an optimization, unnecessary for correctness. Thus, it sounds like there is a bug.

Summary from out-of-band conversation:

hypothesis: the unique pointer is actually destroyed on the destination core (not the source core where the promise is created/set). in this case the promise destructor detects the issue and has a problem. the foreign pointer masks that, doing double duty as an optimization and correctness. I think that in this case it is less of an optimization since we are adding additional round-trips.

in submit_to(source_core, ...) when we set the promise value, it seems like we could at that moment also take care of reseting the unique_ptr and destroying the promise on the correct core (basically the same thing that foreign pointer would be doing).

I tried this and I didn't have any issues, but I'm not really sure how to reproduce the issue you were seeing when the foreign pointer wasn't being used.

also, after res.dispatched completes, can we also background the promise sets (not just in the error cases?)

diff --git a/src/v/kafka/server/handlers/produce.cc b/src/v/kafka/server/handlers/produce.cc index 8e0495caa..34aa9879a 100644 --- a/src/v/kafka/server/handlers/produce.cc +++ b/src/v/kafka/server/handlers/produce.cc @@ -237,7 +237,7 @@ static partition_produce_stages produce_topic_partition( auto reader = reader_from_lcore_batch(std::move(batch)); auto start = std::chrono::steady_clock::now(); - auto dispatch = ss::make_foreign<std::unique_ptr<ss::promise<>>>( + auto dispatch = std::unique_ptr<ss::promise<>>( std::make_unique<ss::promise<>>()); auto dispatch_f = dispatch->get_future(); auto f @@ -259,6 +259,7 @@ static partition_produce_stages produce_topic_partition( (void)ss::smp::submit_to( source_shard, [dispatch = std::move(dispatch)]() mutable { dispatch->set_value(); + dispatch.reset(); }); return ss::make_ready_future<produce_response::partition>( produce_response::partition{ @@ -270,6 +271,7 @@ static partition_produce_stages produce_topic_partition( (void)ss::smp::submit_to( source_shard, [dispatch = std::move(dispatch)]() mutable { dispatch->set_value(); + dispatch.reset(); }); return ss::make_ready_future<produce_response::partition>( produce_response::partition{ @@ -288,18 +290,22 @@ static partition_produce_stages produce_topic_partition( .then_wrapped([source_shard, dispatch = std::move(dispatch)]( ss::future<> f) mutable { if (f.failed()) { - return ss::smp::submit_to( + (void)ss::smp::submit_to( source_shard, [dispatch = std::move(dispatch), e = f.get_exception()]() mutable { dispatch->set_exception(e); + dispatch.reset(); }); + return ss::now(); } - return ss::smp::submit_to( + (void)ss::smp::submit_to( source_shard, [dispatch = std::move(dispatch)]() mutable { dispatch->set_value(); + dispatch.reset(); }); + return ss::now(); }) .then([f = std::move(stages.produced)]() mutable { return std::move(f);

I was seeing an error after this code was deployed to the cluster and executed for a while. I am going to check this

I guess it is reasonable to use foreign pointer as an RAII tool for cases like moving around the promise which will panic if destroyed on the other core. but it was good to get to the bottom of the reasoning behind things being fragile.

looking again, I think the right solution is to use the foreign pointer (for the RAII protection against exception paths), but to do the explicit reset on the source core to avoid the extra core round-trip. that extra round trip should be avoided in foreign pointer destructor if it finds that the pointer had already been reset.

src/v/kafka/server/handlers/produce.cc

Introduced a type aggregating futures representing two stages of kafka request processing. This way request handler will be able to decide which part of the processing should be executed in foreground (blocking other requests from being handled) and which can be executed in background asynchronous to other requests processing. Signed-off-by: Michal Maslanka <michal@vectorized.io>

Split handling of Kafka request into two stages. Dispatch stage is executed in foreground while the second stage is executed in background. This way we can leverage the fact that request processing order is established before its processing completely finished and handle multiple requests at the time without compromising correct ordering. Signed-off-by: Michal Maslanka <michal@vectorized.io>

Implemented two stage handling of produce request. The two phases of produce request processing are reflected in two phases of `cluster::partition::replicate` this way redpanda can handle multiple requests per connection while still not changing the request processing ordering. Signed-off-by: Michal Maslanka <michal@vectorized.io>

Offsets commit handler uses raft to replicate offset commit requests. Leveraging raft two stage replicate processing to to handle multiple in-flight offset commit requests and prevent contention. Signed-off-by: Michal Maslanka <michal@vectorized.io>

github-actions bot added the area/redpanda label May 28, 2021

mmaslankaprv force-pushed the queue-depth-one branch 3 times, most recently from 69fc92f to c120c50 Compare June 1, 2021 07:01

mmaslankaprv force-pushed the queue-depth-one branch from c120c50 to b42002e Compare June 9, 2021 08:03

mmaslankaprv changed the title ~~Queue depth one~~ Processing requests in order in Kafka layer Jun 9, 2021

mmaslankaprv marked this pull request as ready for review June 9, 2021 08:23

mmaslankaprv requested a review from a team as a code owner June 9, 2021 08:23

mmaslankaprv requested review from Lazin, dotnwat, rystsov and BenPope and removed request for a team June 9, 2021 08:23

dotnwat reviewed Jun 14, 2021

View reviewed changes

dotnwat mentioned this pull request Jun 14, 2021

High latencies seen when machine saturation is hit #608

Open

mmaslankaprv force-pushed the queue-depth-one branch 2 times, most recently from ef5dfce to 362d3ca Compare June 15, 2021 08:33

mmaslankaprv added 2 commits June 15, 2021 12:16

mmaslankaprv force-pushed the queue-depth-one branch from 362d3ca to 4b8a89d Compare June 15, 2021 10:16

mmaslankaprv added 2 commits June 15, 2021 20:28

mmaslankaprv force-pushed the queue-depth-one branch from 4b8a89d to d0fab2c Compare June 15, 2021 18:33

dotnwat approved these changes Jun 15, 2021

View reviewed changes

dotnwat merged commit e7d88a3 into redpanda-data:dev Jun 15, 2021

dotnwat mentioned this pull request Jun 15, 2021

redpanda: use foreign pointer with explicit reset in cross-core promise #1617

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Processing requests in order in Kafka layer #1496

Processing requests in order in Kafka layer #1496

mmaslankaprv commented May 28, 2021 •

edited

Loading

dotnwat left a comment

dotnwat Jun 14, 2021

mmaslankaprv Jun 14, 2021

dotnwat Jun 14, 2021

mmaslankaprv Jun 15, 2021

dotnwat Jun 15, 2021

dotnwat Jun 15, 2021

dotnwat Jun 15, 2021

mmaslankaprv Jun 15, 2021

dotnwat Jun 15, 2021

Processing requests in order in Kafka layer #1496

Processing requests in order in Kafka layer #1496

Conversation

mmaslankaprv commented May 28, 2021 • edited Loading

Cover letter

Release notes

dotnwat left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mmaslankaprv commented May 28, 2021 •

edited

Loading