Deduplicate Heavy CCR Repository CS Requests #91398

original-brownbear · 2022-11-08T11:55:39Z

We run the same request back to back for each put-follower call during the restore. Also, concurrent put-follower calls will all run the same full CS request concurrently.
In older versions prior to #87235 the concurrency was limited by the size of the snapshot pool. With that fix though, they are run at almost arbitry concurrency when many put-follow requests are executed concurrently.
-> fixed by using the existing deduplicator to only run a single remote CS request at a time for each CCR repository.
Also, this removes the needless forking in the put-follower action that is not necessary any longer now that we have the CCR repository non-blocking (we do the same for normal restores that can safely be started from a transport thread), which should fix some bad-ux situations where the snapshot threads are busy on master, making the put-follower requests not go through in time.

We run the same request back to back for each put-follower call during the restore. Also, concurrent put-follower calls will all run the same full CS request concurrently. In older versions prior to elastic#87235 the concurrency was limited by the size of the snapshot pool. With that fix though, they are run at almost arbitry concurrency when many put-follow requests are executed concurrently. -> fixed by using the existing deduplicator to only run a single remote CS request at a time for each CCR repository. Also, this removes the needless forking in the put-follower action that is not necessary any longer now that we have the CCR repository non-blocking (we do the same for normal restores that can safely be started from a transport thread), which should fix some bad-ux situations where the snapshot threads are busy on master, making the put-follower requests not go through in time.

elasticsearchmachine · 2022-11-08T11:56:03Z

Pinging @elastic/es-distributed (Team:Distributed)

elasticsearchmachine · 2022-11-08T11:56:04Z

Hi @original-brownbear, I've created a changelog YAML for you.

original-brownbear · 2022-11-08T11:57:51Z

x-pack/plugin/ccr/src/main/java/org/elasticsearch/xpack/ccr/repository/CcrRepository.java

+    /**
+     * Dummy request key for deduplicating all remote cluster state requests via {@link #getRemoteStateDeduplicator}.
+     */
+    private static final Object RESULT_KEY = new Object();


This is a little awkward but I think it's good enough and I didn't want to build a whole new things for the deduplication here when the existing deduplicator works just fine for what we need here this way ...

DaveCTurner

LGTM

x-pack/plugin/ccr/src/main/java/org/elasticsearch/xpack/ccr/repository/CcrRepository.java

henningandersen

I think this could introduce a (edge case for sure) flaw but I also think the cure is easy enough that we should do it.

henningandersen · 2022-11-08T12:17:48Z

x-pack/plugin/ccr/src/main/java/org/elasticsearch/xpack/ccr/repository/CcrRepository.java

+        // We only allow a single remote cluster state requests at a time. The callbacks to the cluster state responses run on the
+        // transport thread and can safely assume they are fast enough so that this does not lead to seeing substantially outdated
+        // remote states as a result of a hot loop calling this method ever.
+        getRemoteStateDeduplicator.executeOnce(


I think the use of the deduplicator risks outdated info (which I see mentioned in the comment, but not sure I follow the hot-ness argument). I think of it mainly as if the remote cluster is slow to response we risk someone having an application that:

Creates an index on the remote cluster (leader).

Invokes put-follow on the local cluster (follower).

The put-follow request could then fail due to seeing an outdated cluster state (in case of other concurrent put-follow requests causing this)?

I think refactoring CapacityResponseCache will do what you want here. Seems like a utility we want to have - to only do one calculation and collapse queued requests into one, which is what CapacityResponseCache does.

The put-follow request could then fail due to seeing an outdated cluster state (in case of other concurrent put-follow requests causing this)?

Hmm maybe ... you're right here actually I think, let me try refactoring that thing :)

Hmm CapacityResponseCache turned out to be quite different from what we need here since it deals with a heavy but synchronous action.

I implemented a simple solution similar to what we have for deduplicating repository data in the blob store repository now that we could extract and use for e.g. stats as well like we discussed in the past. Let me know if this is ok with you.
Did some quick benchmarking with this solution and it's also way superior in performance over the previous one since it deduplicates a lot more requests (with the first call causing all subsequent ones to queue up it works out quite nicely)

henningandersen · 2022-11-08T12:18:24Z

...ck/plugin/ccr/src/main/java/org/elasticsearch/xpack/ccr/action/TransportPutFollowAction.java

@@ -192,56 +191,44 @@ private void createFollowerIndex(
            threadPool.getThreadContext().getHeaders(),
            clusterService.state()
        );
-        threadPool.executor(ThreadPool.Names.SNAPSHOT).execute(new AbstractRunnable() {


Not sure I follow why this is important to this PR?

~~I kinda liked cleaning this up here since it's part of the necessary follow-up fixes for the async behaviour to work neatly in a sense, but I can pull it out to a separate PR if you want?~~

EDIT: never mind if I do the other refactoring this gets messy, moving it out :)

Ah Henning is right, we need to use a cluster state requested after receiving the put-follow request.

…-requests

…nto fix-ccr-duplicate-cs-requests

…-requests

henningandersen

This direction looks good.

Can we add a test verifying that concurrent CcrRepository.getRepositoryData calls only executes one call at a time on the leader and also does the batching (something like once we fired all concurrent calls, we expect only one more invocation on leader)?

henningandersen · 2022-11-10T10:28:42Z

x-pack/plugin/ccr/src/main/java/org/elasticsearch/xpack/ccr/repository/CcrRepository.java

+                    response.getNodes().getMaxNodeVersion(),
+                    SnapshotState.SUCCESS
+                );
+            }), false));


Should we now add assert false to the exception block below? Seems like no exceptions should occur anymore, since getRemoteState handles it's own exceptions. If it does, we may have double invoked the listener.

…-requests

original-brownbear · 2022-11-10T11:47:05Z

Can we add a test verifying that concurrent CcrRepository.getRepositoryData calls only executes one call at a time on the leader and also does the batching (something like once we fired all concurrent calls, we expect only one more invocation on leader)?

uff I tried something with the MockTransport and it's not entirely trivial to get this right timing wise. We don't have a nice way of checking things from start to end of request via a listener in org.elasticsearch.test.transport.MockTransportService#addRequestHandlingBehavior. I could build out the infrastructure for this but it'll take me quite some time.

EDIT: I guess we could add some unit test of sorts where we call the repo directly ... still quite a bit of work and this seems like it should be fixed rather sooner than later since it breaks larger users of CCR?

I wanted to go for the same approach in other code, maybe it makes more sense to wait for that and just unit-test the new deduplicator?

original-brownbear · 2022-11-10T12:13:41Z

Jenkins run elasticsearch-ci/part-2

henningandersen · 2022-11-10T13:09:28Z

it's not entirely trivial to get this right timing wise. We don't have a nice way of checking things from start to end of request via a listener in

Could we make a less ambitious test that holds a latch in the beginning of the test, which we wait for in the leader request handling behavior for cluster/state, start X>2 getRepositoryData calls and then check that all requests are responded to (bare minimum) and if we can that there is only 2 requests in the leader cluster (from the follower)?

original-brownbear · 2022-11-10T13:37:15Z

Could we make a less ambitious test that holds a latch in the beginning of the test, which we wait for in the leader request handling behavior for cluster/state, start X>2 getRepositoryData calls and then check that all requests are responded to (bare minimum) and if we can that there is only 2 requests in the leader cluster (from the follower)?

This is exactly what I tried. It's not quite as trivial as it seems. The follower will send various requests to the leader (state just for a single index for example) so I can't simply block a transport thread via a latch because that will be unstable and lead to other requests getting blocked potentially.
So you'll need some non-blocking delay here ideally (had to build that before for another test I believe), and then to really only get two requests for X put-follow actions, you will have to have some waiting for when exactly all those requests have gone out and had their response which is all not rocket science but will probably take me a couple of hours to get really stable.

…-requests

henningandersen

LGTM.

original-brownbear · 2022-11-20T18:12:13Z

Thanks Henning! reviewed before I even had the chance to ping ❤️ :)

elasticsearchmachine · 2022-11-20T18:13:49Z

💔 Backport failed

Status	Branch	Result
❌	7.17	Commit could not be cherrypicked due to conflicts
❌	8.5	Commit could not be cherrypicked due to conflicts

You can use sqren/backport to manually backport by running backport --upstream elastic/elasticsearch --pr 91398

We run the same request back to back for each put-follower call during the restore. Also, concurrent put-follower calls will all run the same full CS request concurrently. In older versions prior to elastic#87235 the concurrency was limited by the size of the snapshot pool. With that fix though, they are run at almost arbitry concurrency when many put-follow requests are executed concurrently. -> fixed by using the existing deduplicator to only run a single remote CS request at a time for each CCR repository. Also, this removes the needless forking in the put-follower action that is not necessary any longer now that we have the CCR repository non-blocking (we do the same for normal restores that can safely be started from a transport thread), which should fix some bad-ux situations where the snapshot threads are busy on master, making the put-follower requests not go through in time.

We run the same request back to back for each put-follower call during the restore. Also, concurrent put-follower calls will all run the same full CS request concurrently. In older versions prior to #87235 the concurrency was limited by the size of the snapshot pool. With that fix though, they are run at almost arbitry concurrency when many put-follow requests are executed concurrently. -> fixed by using the existing deduplicator to only run a single remote CS request at a time for each CCR repository. Also, this removes the needless forking in the put-follower action that is not necessary any longer now that we have the CCR repository non-blocking (we do the same for normal restores that can safely be started from a transport thread), which should fix some bad-ux situations where the snapshot threads are busy on master, making the put-follower requests not go through in time.

original-brownbear added >bug :Distributed/CCR Issues around the Cross Cluster State Replication features v8.5.1 v8.6.0 v7.17.8 labels Nov 8, 2022

elasticsearchmachine added the Team:Distributed Meta label for distributed team label Nov 8, 2022

Update docs/changelog/91398.yaml

9c77697

original-brownbear commented Nov 8, 2022

View reviewed changes

DaveCTurner previously approved these changes Nov 8, 2022

View reviewed changes

x-pack/plugin/ccr/src/main/java/org/elasticsearch/xpack/ccr/repository/CcrRepository.java Outdated Show resolved Hide resolved

henningandersen reviewed Nov 8, 2022

View reviewed changes

original-brownbear added 7 commits November 8, 2022 14:41

Merge remote-tracking branch 'elastic/main' into fix-ccr-duplicate-cs…

67b9abf

…-requests

revert separate fix

4c75ad3

Merge remote-tracking branch 'elastic/main' into fix-ccr-duplicate-cs…

f3532eb

…-requests

custom approach

f41ea96

Merge remote-tracking branch 'origin/fix-ccr-duplicate-cs-requests' i…

8004e90

…nto fix-ccr-duplicate-cs-requests

Merge remote-tracking branch 'elastic/main' into fix-ccr-duplicate-cs…

fb9db07

…-requests

Merge remote-tracking branch 'elastic/main' into fix-ccr-duplicate-cs…

39b4221

…-requests

original-brownbear requested review from DaveCTurner and henningandersen November 9, 2022 10:27

henningandersen reviewed Nov 10, 2022

View reviewed changes

original-brownbear added 2 commits November 10, 2022 12:07

Merge remote-tracking branch 'elastic/main' into fix-ccr-duplicate-cs…

f009ebd

…-requests

add no failure asserts

29686da

original-brownbear requested a review from henningandersen November 10, 2022 11:48

tmgordeeva added v8.5.2 and removed v8.5.1 labels Nov 15, 2022

kingherc added v8.7.0 and removed v8.6.0 labels Nov 16, 2022

original-brownbear added 3 commits November 17, 2022 14:32

Merge remote-tracking branch 'elastic/main' into fix-ccr-duplicate-cs…

9a407ca

…-requests

refactor + tests

ccaf210

Merge remote-tracking branch 'elastic/main' into fix-ccr-duplicate-cs…

ca861b0

…-requests

henningandersen approved these changes Nov 20, 2022

View reviewed changes

original-brownbear added the auto-backport-and-merge Automatically create backport pull requests and merge when ready label Nov 20, 2022

original-brownbear merged commit d1c5ca2 into elastic:main Nov 20, 2022

original-brownbear deleted the fix-ccr-duplicate-cs-requests branch November 20, 2022 18:12

elasticsearchmachine added the backport pending label Nov 20, 2022

bpintea added v8.5.3 and removed v8.5.2 labels Nov 22, 2022

original-brownbear removed the v8.5.3 label Apr 19, 2023

original-brownbear mentioned this pull request Apr 19, 2023

Deduplicate Heavy CCR Repository CS Requests (#91398) #95372

Merged

original-brownbear added v7.17.10 and removed backport pending v7.17.8 labels Apr 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deduplicate Heavy CCR Repository CS Requests #91398

Deduplicate Heavy CCR Repository CS Requests #91398

original-brownbear commented Nov 8, 2022

elasticsearchmachine commented Nov 8, 2022

elasticsearchmachine commented Nov 8, 2022

original-brownbear Nov 8, 2022

DaveCTurner left a comment

henningandersen left a comment

henningandersen Nov 8, 2022 •

edited

Loading

original-brownbear Nov 8, 2022

original-brownbear Nov 9, 2022

henningandersen Nov 8, 2022

original-brownbear Nov 8, 2022 •

edited

Loading

original-brownbear Nov 8, 2022

henningandersen left a comment

henningandersen Nov 10, 2022

original-brownbear Nov 10, 2022

original-brownbear commented Nov 10, 2022 •

edited

Loading

original-brownbear commented Nov 10, 2022

henningandersen commented Nov 10, 2022

original-brownbear commented Nov 10, 2022

henningandersen left a comment

original-brownbear commented Nov 20, 2022

elasticsearchmachine commented Nov 20, 2022

Deduplicate Heavy CCR Repository CS Requests #91398

Deduplicate Heavy CCR Repository CS Requests #91398

Conversation

original-brownbear commented Nov 8, 2022

elasticsearchmachine commented Nov 8, 2022

elasticsearchmachine commented Nov 8, 2022

original-brownbear Nov 8, 2022

Choose a reason for hiding this comment

DaveCTurner left a comment

Choose a reason for hiding this comment

henningandersen left a comment

Choose a reason for hiding this comment

henningandersen Nov 8, 2022 • edited Loading

Choose a reason for hiding this comment

original-brownbear Nov 8, 2022

Choose a reason for hiding this comment

original-brownbear Nov 9, 2022

Choose a reason for hiding this comment

henningandersen Nov 8, 2022

Choose a reason for hiding this comment

original-brownbear Nov 8, 2022 • edited Loading

Choose a reason for hiding this comment

original-brownbear Nov 8, 2022

Choose a reason for hiding this comment

henningandersen left a comment

Choose a reason for hiding this comment

henningandersen Nov 10, 2022

Choose a reason for hiding this comment

original-brownbear Nov 10, 2022

Choose a reason for hiding this comment

original-brownbear commented Nov 10, 2022 • edited Loading

original-brownbear commented Nov 10, 2022

henningandersen commented Nov 10, 2022

original-brownbear commented Nov 10, 2022

henningandersen left a comment

Choose a reason for hiding this comment

original-brownbear commented Nov 20, 2022

elasticsearchmachine commented Nov 20, 2022

💔 Backport failed

henningandersen Nov 8, 2022 •

edited

Loading

original-brownbear Nov 8, 2022 •

edited

Loading

original-brownbear commented Nov 10, 2022 •

edited

Loading