fix: concurrency bug when doing high volume pull query over java client #10075

lucasbru · 2023-10-02T12:18:50Z

Description

Fix for a race inside vert.x’s RecordParser. Two threads are hitting RecordParserImpl.handleParsing, one is entering from upstream via RecordParser.handle, and one thread is coming from the downstream via ReadStream.resume. Once our PullQueryWriteStream is half empty, we call the drain handler, which calls RecordParser.resume, which synchronously starts parsing new records from the buffer. RecordParser.handleParsing is not thread-safe.

The cause is that the monitor inside PullQueryWriteStream is insufficient to protect from races inside the RecordParser. This fix uses a separate monitor (the object monitor of the HTTP response object) to prevent races inside the RecordParser. To implement the fix, we define a shim wrapper around RecordParser, which makes sure to acquire the monitor both on handle and on resume.

In the future, we may want to fix this by changing the way the drain handler is called.

Testing done

No behavior changes in existing integration tests. Tested locally.

Reviewer checklist

Ensure docs are updated if necessary. (eg. if a user visible feature is being added or changed).
Ensure relevant issues are linked (description should include text like "Fixes #")
Do these changes have compatibility implications for rollback? If so, ensure that the ksql command version is bumped.

Fix for a race inside vert.x’s `RecordParser`. Two threads are hitting `RecordParserImpl.handleParsing`, one is entering from upstream via `RecordParser.handle`, and one thread is coming from the downstream via `ReadStream.resume`. Once our `PullQueryWriteStream` is half empty, we call the drain handler, which calls `RecordParser.resume`, which synchronously starts parsing new records from the buffer. `RecordParser.handleParsing` is not thread-safe. The cause is that the monitor inside `PullQueryWriteStream` is insufficient to protect from races inside the `RecordParser`. This fix uses a separate monitor (the object monitor of the HTTP response object) to prevent races inside the `RecordParser`. To implement the fix, we define a shim wrapper around `RecordParser`, which makes sure to acquire the monitor both on `handle` and on `resume`. In the future, we may want to fix this by changing the way the drain handler is called.

cla-assistant · 2023-10-02T12:19:05Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

cla-assistant · 2023-10-02T12:19:06Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

cadonna

@lucasbru Thanks for the fix!

Here my feedback!

cadonna · 2023-10-04T13:39:21Z

ksqldb-engine/src/main/java/io/confluent/ksql/query/PullQueryWriteStream.java

-      } finally {
-        monitor.leave();
-      }
+    if (isDone() || size() <= queueCapacity / 2) {


Why did you remove the monitor guard?

Firstly, it's not clear that we need it, because the drain handler doesn't modify the state of the WriteStream - it only modifies the state of the RecordParser, and the RecordParser now has a separate monitor.

More importantly, the monitor now can cause a deadlock:

RecordParser.handle acquires the new monitor that I introduced, and subsequently calls PullQueryWriteStream.handle, which attempts to acquire the monitor in this context.

PullQueryWriteStream.pollRow acquires monitor and calls RecordParser.resume which attempts to acquire the monitor that I introduced.

So we remove aquiring monitor here.

I believe the main original thought was that you might want to call the drain handler while the condition of it being half full was guaranteed to be true. It's true that if you are not holding a lock that the condition might not be true when you act on it, but the queue will never drop data, so worse case, you go over the soft limit. Looking through some other examples, I don't think they hold any locks either.

Below in the definition of drainHandler, can you do something like this:

@Override public PullQueryWriteStream drainHandler(final Handler<Void> handler) { Context context = Vertx.currentContext(); drainHandler.add(v -> { context.runOnContext(handler); }); return this; }

This might be worth trying before doing the synchronization. I believe with the existing synchronization in this class protecting internal state from the write calls and polls from different threads, and then the callbacks always happening on the same Vertx thread, it would also work.

@cadonna and I tried running it on the context of the Read Stream, but failed to fix it this way. The problem was that the pull queries would not make progress, and I couldn't determine what was causing it. Today I gave this another look, and I was able to figure it our -- we are calling the drain handler wayyyy to often, so we are clogging the netty event loop completely with drain handler calls. The pull queries make progress, but the netty event loops are essentially at 100% CPU just going through drain handler calls. I think the correct fix is to call the drain handler only exactly once - the pipe implementation will re-register its drain handler when the WriteStream reaches capacity. @AlanConfluent could you have a look at the alternative fix here: #10077

cadonna · 2023-10-04T13:51:13Z

ksqldb-engine/src/main/java/io/confluent/ksql/query/PullQueryWriteStream.java

+    if (isDone() || size() <= queueCapacity / 2) {
+      drainHandler.forEach(h -> h.handle(null));
    }


I think the drain handler is called too often also in the original code. As far as I understand the drain handler should be called to resume the source that is piped into the write stream (i.e. the http response) after is has been paused. Just because the size of the queue is less than half its capacity, it does not mean the http response has been paused, it could also be that the response gets content slower.
During our investigation, I tried to call the drain handler only when the http response has been paused before and that has already reduced the error rate significantly but did not solve the issue, though.
What do you think of also reducing the calls to the drain handler?
Probably, we should not do it in this PR since for testing your fix we want to hit the issue often. Maybe let's consider it an optimization that we might or might not add afterwards.

Yes, I think the whole way the drain handler is used is quite mysterious to me. It's called too often, it's protected by a monitor whose purpose is not clear, it's called from a different thread (other WriteStreams call it from write, so basically from the same context).

I think you are right that it should be called less often, however this change is really to do the minimum change necessary. It's a good idea to try this in a separate PR.

ksqldb-rest-client/src/main/java/io/confluent/ksql/rest/client/SyncronizedRecordParser.java

cadonna · 2023-10-04T15:20:13Z

There are some build errors.

AlanConfluent · 2023-10-04T21:24:56Z

Fix for a race inside vert.x’s RecordParser. Two threads are hitting RecordParserImpl.handleParsing, one is entering from upstream via RecordParser.handle, and one thread is coming from the downstream via ReadStream.resume. Once our PullQueryWriteStream is half empty, we call the drain handler, which calls RecordParser.resume, which synchronously starts parsing new records from the buffer. RecordParser.handleParsing is not thread-safe.

This is an interesting insight. It occurs to me that many of these pieces like RecordParser assume they are being called from the same vertx thread as both the things producing to it and consuming from it. If you look at the callback within the http response hander, they do exactly this: https://github.com/eclipse-vertx/vert.x/blob/master/src/main/java/io/vertx/core/http/impl/Http1xServerResponse.java#L542

I think we've broken that model here, as you've pointed out. There's the Vertx thread thread which is writing to the RecordParser, and then the Pull query thread which is pulling from the queue (and invoking the drain callback and calling resume). One solution would be to just invoke the handler using the same Vertx thread which registers the drain handler. That might be simpler than adding synchronization.

AlanConfluent · 2023-10-04T21:06:01Z

ksqldb-engine/src/main/java/io/confluent/ksql/query/PullQueryWriteStream.java

-      } finally {
-        monitor.leave();
-      }
+    if (isDone() || size() <= queueCapacity / 2) {


I believe the main original thought was that you might want to call the drain handler while the condition of it being half full was guaranteed to be true. It's true that if you are not holding a lock that the condition might not be true when you act on it, but the queue will never drop data, so worse case, you go over the soft limit. Looking through some other examples, I don't think they hold any locks either.

AlanConfluent · 2023-10-04T21:48:15Z

ksqldb-engine/src/main/java/io/confluent/ksql/query/PullQueryWriteStream.java

-      } finally {
-        monitor.leave();
-      }
+    if (isDone() || size() <= queueCapacity / 2) {


Below in the definition of drainHandler, can you do something like this:

@Override public PullQueryWriteStream drainHandler(final Handler<Void> handler) { Context context = Vertx.currentContext(); drainHandler.add(v -> { context.runOnContext(handler); }); return this; }

This might be worth trying before doing the synchronization. I believe with the existing synchronization in this class protecting internal state from the write calls and polls from different threads, and then the callbacks always happening on the same Vertx thread, it would also work.

AlanConfluent · 2023-10-04T21:58:39Z

ksqldb-rest-client/src/main/java/io/confluent/ksql/rest/client/SynchronizedRecordParser.java

+
+  @Override
+  public RecordParser resume() {
+    synchronized (source) {


This ensures it's synchronized for anything sharing the same response object, but doesn't prevent concurrent calls with handle. Are you trying to protect the state in the RecordParser delegate between calls to resume and handle from different threads? If so, I would think you would think they should both use delegate (or even source) as the lock, but not different ones.

Also, I'm slightly worried about other internal uses of state in RecordParser that could access state concurrently with these synchronized calls. It seems like you would have to audit the delegate implementation to know which methods require additional synchronization and potentially add them here.

AlanConfluent · 2023-10-04T21:59:49Z

ksqldb-rest-client/src/main/java/io/confluent/ksql/rest/client/SynchronizedRecordParser.java

+  }
+
+  @Override
+  public RecordParser exceptionHandler(final Handler<Throwable> handler) {


For all of these "fluid" calls that return a RecordParser, wouldn't you want to return this rather than the underlying RecordParser? Otherwise, they might circumvent your locking.

lucasbru requested a review from a team as a code owner October 2, 2023 12:18

cadonna reviewed Oct 4, 2023

View reviewed changes

File Header

e83aff6

cadonna reviewed Oct 4, 2023

View reviewed changes

ksqldb-rest-client/src/main/java/io/confluent/ksql/rest/client/SyncronizedRecordParser.java Outdated Show resolved Hide resolved

Fixes

92b8956

AlanConfluent reviewed Oct 4, 2023

View reviewed changes

lucasbru closed this Oct 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: concurrency bug when doing high volume pull query over java client #10075

fix: concurrency bug when doing high volume pull query over java client #10075

lucasbru commented Oct 2, 2023

cla-assistant bot commented Oct 2, 2023

cla-assistant bot commented Oct 2, 2023

cadonna left a comment

cadonna Oct 4, 2023

lucasbru Oct 4, 2023

AlanConfluent Oct 4, 2023

AlanConfluent Oct 4, 2023

lucasbru Oct 5, 2023

cadonna Oct 4, 2023

lucasbru Oct 4, 2023 •

edited

Loading

cadonna commented Oct 4, 2023

AlanConfluent commented Oct 4, 2023

AlanConfluent Oct 4, 2023

AlanConfluent Oct 4, 2023

AlanConfluent Oct 4, 2023

AlanConfluent Oct 4, 2023

AlanConfluent Oct 4, 2023

fix: concurrency bug when doing high volume pull query over java client #10075

fix: concurrency bug when doing high volume pull query over java client #10075

Conversation

lucasbru commented Oct 2, 2023

Description

Testing done

Reviewer checklist

cla-assistant bot commented Oct 2, 2023

cla-assistant bot commented Oct 2, 2023

cadonna left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lucasbru Oct 4, 2023 • edited Loading

Choose a reason for hiding this comment

cadonna commented Oct 4, 2023

AlanConfluent commented Oct 4, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lucasbru Oct 4, 2023 •

edited

Loading