GEODE-10056: Improve gateway-receiver load balance #7378

jvarenina · 2022-02-17T09:11:28Z

The problem is that servers send incorrect gateway-receiver connection
load to locators within CacheServerLoadMessage. Additionally, locators
do not refresh gateway-receivers load with the load received in
CacheServerLoadMessage. The only time locator increments
gateway-receiver load is after it receives
ClientConnectionRequest{group=__recv_group...} and returns selected
server in ClientConnectionResponse message. The client sends
a ClientConnectionRequest to the one locator from list received
in RemoteLocatorJoinResponse (initial list of locators) or
LocatorListRequest (periodically updated list of locators).
The received list is always sorted by the host address and port.
The client will send ClientConnectionRequest following the
sorted list of locators (from first to last) until a successful outcome.
That means that the same locator (first one in the list)
will handle all connection requests in normal conditions, and other
locators will not update their gateway-receivers connection load.

The solution is to track the gateway-receiver acceptor connection count
correctly and, based on it, accurately calculate the load
when sending CacheServerLoadMessage. Additionally, each locator will
read the load received from CacheServerLoadMessage and update the
gateway-receivers connection load in group __recv__group accordingly

For all changes:

Is there a JIRA ticket associated with this PR? Is it referenced in the commit message?
Has your PR been rebased against the latest commit within the target branch (typically develop)?
Is your initial contribution a single, squashed commit?
Does gradlew build run cleanly?
Have you written or updated unit tests to verify your changes?
If adding new dependencies to the code, are these dependencies licensed in a way that is compatible for inclusion under ASF 2.0?

onichols-pivotal

comment deleted

jvarenina · 2022-02-23T07:35:24Z

Hi reviewers,

With this solution, each server will now send CacheServerLoadMessage containing the correct connection load of the gateway-receiver to all locators in the cluster. This action will happen every 5 seconds as configured with the load-poll-interval parameter. Additionally, the coordinator locator will increase the load each time it provides the server location to the remote gateway-sender in ClientConnectionRequest/ClientConnectionResponse. Locator only maintains load temporarily until CacheServerLoadMessage is received. This behavior makes sense as the server tracks connection load more accurately than the locator. Locator only increases connection load based on the received connection requests while server adjusts the connection load each time connection is established and disconnected.

ClientConnectionRequest messages are usually sent to the locator in bursts when the gateway-sender is establishing connections due to traffic. This behavior results in the locator's connection load being way ahead of the server connection load because servers did not establish those connections yet. Suppose during these bursts CacheServerLoadMessage message come to locator carrying low load value for one of the gateway-receivers. In that case, that receiver will be picked more frequently (will have the lowest load), resulting in unbalanced gateway-sender connections. In order for this to have a big impact on load-balancing of sender connections the gateway-receivers must be started with some small delay, so that CacheServerLoadMessages are sent with some delay that is enough to cause imbalance. If CacheServerLoadMessages were sent at the similar time then this would not be a problem as all messages would have similar load and would update locator at similar time.

I would be really grateful if you could share your opinion on this matter?

The problem is that servers send incorrect gateway-receiver connection load to locators within CacheServerLoadMessage. Additionally, locators do not refresh gateway-receivers load with the load received in CacheServerLoadMessage. The only time locator increments gateway-receiver load is after it receives ClientConnectionRequest{group=__recv_group...} and returns selected server in ClientConnectionResponse message. This is done only by coordinator, so that means that other locators will have load with initial values, since it is never updated. The solution is to correctly track gateway-receiver acceptor connection count and then based on it correctly calculate the load when sending CacheServerLoadMessage. Additionally each locator will read the load received from CacheServerLoadMessage and update load for gateway-receiver location id in group __recv__group accordingly.

DonalEvans

I'm only a codeowner for one of the files changed in this PR, so I can't give much feedback on the overall approach, but there are a few general clean-up changes that would be good to make.

geode-core/src/main/java/org/apache/geode/distributed/internal/LocatorLoadSnapshot.java

geode-core/src/main/java/org/apache/geode/distributed/internal/ServerLocator.java

...e-core/src/test/java/org/apache/geode/distributed/internal/LocatorLoadSnapshotJUnitTest.java

...e/internal/cache/wan/parallel/ParallelGatewaySenderConnectionLoadBalanceDistributedTest.java

boglesby · 2022-03-16T00:53:29Z

I'm not sure how to resolve the race condition you mention, but I see similar behavior with client/server connections.

If a burst of connections is requested and none of those are made before the next load is received from the server, then the locator's load for that server gets reset back to zero.

A burst of connections (10 in this case) causes the load to go from 0.0 to 0.012499998:

[warn 2022/03/15 14:38:37.905 PDT locator <locator request thread 1> tid=0x24] XXX LocatorLoadSnapshot.getServerForConnection potentialServers={192.168.1.5:51249@192.168.1.5(server1:30200)<v1>:41001=LoadHolder[0.0, 192.168.1.5:51249, loadPollInterval=5000, 0.00125]}

[warn 2022/03/15 14:38:37.906 PDT locator <locator request thread 1> tid=0x24] XXX LocatorLoadSnapshot.getServerForConnection selectedServer=192.168.1.5:51249; loadBeforeUpdate=0.0

[warn 2022/03/15 14:38:37.907 PDT locator <locator request thread 1> tid=0x24] XXX LoadHolder.incConnections location=192.168.1.5:51249; load=0.00125

[warn 2022/03/15 14:38:37.907 PDT locator <locator request thread 1> tid=0x24] XXX LocatorLoadSnapshot.getServerForConnection selectedServer=192.168.1.5:51249; loadAfterUpdate=0.00125

...

[warn 2022/03/15 14:38:38.005 PDT locator <locator request thread 1> tid=0x24] XXX LocatorLoadSnapshot.getServerForConnection potentialServers={192.168.1.5:51249@192.168.1.5(server1:30200)<v1>:41001=LoadHolder[0.011249999, 192.168.1.5:51249, loadPollInterval=5000, 0.00125]}

[warn 2022/03/15 14:38:38.005 PDT locator <locator request thread 1> tid=0x24] XXX LocatorLoadSnapshot.getServerForConnection selectedServer=192.168.1.5:51249; loadBeforeUpdate=0.011249999

[warn 2022/03/15 14:38:38.005 PDT locator <locator request thread 1> tid=0x24] XXX LoadHolder.incConnections location=192.168.1.5:51249; load=0.012499998

[warn 2022/03/15 14:38:38.005 PDT locator <locator request thread 1> tid=0x24] XXX LocatorLoadSnapshot.getServerForConnection selectedServer=192.168.1.5:51249; loadAfterUpdate=0.012499998

If none of those connections are made before the next load is sent by that server, its load goes from 0.012499998 to 0.0:

[warn 2022/03/15 14:39:25.140 PDT locator <P2P message reader for 192.168.1.5(server1:30200)<v1>:41001 unshared ordered sender uid=5 dom #1 local port=55139 remote port=51286> tid=0x56] XXX LocatorLoadSnapshot.updateLoad about to update connectionLoadMap location=192.168.1.5:51249; load=0.0; loadPerConnection=0.00125

[warn 2022/03/15 14:39:25.140 PDT locator <P2P message reader for 192.168.1.5(server1:30200)<v1>:41001 unshared ordered sender uid=5 dom #1 local port=55139 remote port=51286> tid=0x56] XXX LocatorLoadSnapshot.updateMap location=192.168.1.5:51249; loadBeforeUpdate=0.012499998

[warn 2022/03/15 14:39:25.141 PDT locator <P2P message reader for 192.168.1.5(server1:30200)<v1>:41001 unshared ordered sender uid=5 dom #1 local port=55139 remote port=51286> tid=0x56] XXX LocatorLoadSnapshot.updateMap location=192.168.1.5:51249; loadAfterUpdate=0.0

[warn 2022/03/15 14:39:25.141 PDT locator <P2P message reader for 192.168.1.5(server1:30200)<v1>:41001 unshared ordered sender uid=5 dom #1 local port=55139 remote port=51286> tid=0x56] XXX LocatorLoadSnapshot.updateLoad done update connectionLoadMap location=192.168.1.5:51249

The load for the next request starts is 0.0 again:

[warn 2022/03/15 14:39:33.475 PDT locator <locator request thread 2> tid=0x54] XXX LocatorLoadSnapshot.getServerForConnection potentialServers={192.168.1.5:51249@192.168.1.5(server1:30200)<v1>:41001=LoadHolder[0.0, 192.168.1.5:51249, loadPollInterval=5000, 0.00125]}

[warn 2022/03/15 14:39:33.475 PDT locator <locator request thread 2> tid=0x54] XXX LocatorLoadSnapshot.getServerForConnection selectedServer=192.168.1.5:51249; loadBeforeUpdate=0.0

[warn 2022/03/15 14:39:33.475 PDT locator <locator request thread 2> tid=0x54] XXX LoadHolder.incConnections location=192.168.1.5:51249; load=0.00125

[warn 2022/03/15 14:39:33.475 PDT locator <locator request thread 2> tid=0x54] XXX LocatorLoadSnapshot.getServerForConnection selectedServer=192.168.1.5:51249; loadAfterUpdate=0.00125

...

One thing to note is that the load is only sent load-poll-interval (default=5 seconds) if it has changed. If it hasn't changed then it only gets sent every update frequency (which is 10 * 5 seconds by default).

There is a boolean to control that frequency too:

private static final int FORCE_LOAD_UPDATE_FREQUENCY = getInteger(
  GeodeGlossary.GEMFIRE_PREFIX + "BridgeServer.FORCE_LOAD_UPDATE_FREQUENCY", 10);

The load-poll-interva is configurable, but currently only for the cache server not the gateway receiver. It probably wouldn't be too hard to add this support to gateway receiver.

Also, there is a gfsh load-balance gateway-sender command that could help alleviate this condition.

I'm still reviewing the PR.

boglesby · 2022-03-17T18:14:35Z

I ran a few tests with some extra logging on these changes. They look good.

The receiver exchanges profiles with the locator:

[warn 2022/03/16 14:16:12.440 PDT locator-ln <Pooled High Priority Message Processor 2> tid=0x50] XXX LocatorLoadSnapshot.updateConnectionLoadMap location=192.168.1.5:5370; load=0.0

[warn 2022/03/16 14:16:12.441 PDT locator-ln <Pooled High Priority Message Processor 2> tid=0x50] XXX LocatorLoadSnapshot.updateConnectionLoadMap current load for location=192.168.1.5:5370; group=__recv__group; inputLoad=0.0; currentLoad=0.0

[warn 2022/03/16 14:16:12.441 PDT locator-ln <Pooled High Priority Message Processor 2> tid=0x50] XXX LocatorLoadSnapshot.updateConnectionLoadMap updated load for location=192.168.1.5:5370; group=__recv__group; inputLoad=0.0; newLoad=0.0

The connectionLoadMap shows 2 groups, namely the null group (default) and the __recv__group group (gateway receiver), each with load=0.0:

[warn 2022/03/16 14:16:13.777 PDT locator-ln <Thread-14> tid=0x43] XXX LocatorLoadSnapshot.logConnectionLoadMap
The connectionLoadMap contains the following 2 entries:
	group=null
		location=192.168.1.5:56224; load=0.0
	group=__recv__group
		location=192.168.1.5:5370; load=0.0

Sender connects to the receiver:

With the default of 5 dispatcher threads, 5 connections are made to the receiver. The load goes from 0.0 to 0.0062499996:

[warn 2022/03/16 14:16:53.836 PDT locator-ln <locator request thread 2> tid=0x47] XXX LocatorLoadSnapshot.getServerForConnection group=__recv__group; server=192.168.1.5:5370; loadBeforeUpdate=0.0

[warn 2022/03/16 14:16:53.836 PDT locator-ln <locator request thread 2> tid=0x47] XXX LocatorLoadSnapshot.getServerForConnection group=__recv__group; server=192.168.1.5:5370; loadAfterUpdate=0.00125


[warn 2022/03/16 14:16:53.836 PDT locator-ln <locator request thread 6> tid=0x5c] XXX LocatorLoadSnapshot.getServerForConnection group=__recv__group; server=192.168.1.5:5370; loadBeforeUpdate=0.00125

[warn 2022/03/16 14:16:53.836 PDT locator-ln <locator request thread 6> tid=0x5c] XXX LocatorLoadSnapshot.getServerForConnection group=__recv__group; server=192.168.1.5:5370; loadAfterUpdate=0.0025


[warn 2022/03/16 14:16:53.837 PDT locator-ln <locator request thread 5> tid=0x5b] XXX LocatorLoadSnapshot.getServerForConnection group=__recv__group; server=192.168.1.5:5370; loadBeforeUpdate=0.0025

[warn 2022/03/16 14:16:53.837 PDT locator-ln <locator request thread 5> tid=0x5b] XXX LocatorLoadSnapshot.getServerForConnection group=__recv__group; server=192.168.1.5:5370; loadAfterUpdate=0.00375


[warn 2022/03/16 14:16:53.837 PDT locator-ln <locator request thread 4> tid=0x5a] XXX LocatorLoadSnapshot.getServerForConnection group=__recv__group; server=192.168.1.5:5370; loadBeforeUpdate=0.00375

[warn 2022/03/16 14:16:53.837 PDT locator-ln <locator request thread 4> tid=0x5a] XXX LocatorLoadSnapshot.getServerForConnection group=__recv__group; server=192.168.1.5:5370; loadAfterUpdate=0.005


[warn 2022/03/16 14:16:53.838 PDT locator-ln <locator request thread 3> tid=0x59] XXX LocatorLoadSnapshot.getServerForConnection group=__recv__group; server=192.168.1.5:5370; loadBeforeUpdate=0.005

[warn 2022/03/16 14:16:53.838 PDT locator-ln <locator request thread 3> tid=0x59] XXX LocatorLoadSnapshot.getServerForConnection group=__recv__group; server=192.168.1.5:5370; loadAfterUpdate=0.0062499996

The connectionLoadMap shows the same 2 groups but now the __recv__group group load is 0.0062499996 for the gateway receiver:

[warn 2022/03/16 14:16:55.831 PDT locator-ln <Thread-14> tid=0x43] XXX LocatorLoadSnapshot.logConnectionLoadMap
The connectionLoadMap contains the following 2 entries:
	group=null
		location=192.168.1.5:56224; load=0.0
	group=__recv__group
		location=192.168.1.5:5370; load=0.0062499996

Update the load:

Periodically, the server sends an updated load to the locator.

[warn 2022/03/16 14:16:57.464 PDT locator-ln <P2P message reader for 192.168.1.5(ln-1:75228)<v1>:41002 unshared ordered sender uid=5 dom #1 local port=45635 remote port=56270> tid=0x5e] XXX LocatorLoadSnapshot.updateConnectionLoadMap current load for location=192.168.1.5:5370; group=__recv__group; inputLoad=0.00625; currentLoad=0.0062499996

[warn 2022/03/16 14:16:57.464 PDT locator-ln <P2P message reader for 192.168.1.5(ln-1:75228)<v1>:41002 unshared ordered sender uid=5 dom #1 local port=45635 remote port=56270> tid=0x5e] XXX LocatorLoadSnapshot.updateConnectionLoadMap updated load for location=192.168.1.5:5370; group=__recv__group; inputLoad=0.00625; newLoad=0.00625

[warn 2022/03/16 14:16:57.832 PDT locator-ln <Thread-14> tid=0x43] XXX LocatorLoadSnapshot.logConnectionLoadMap
The connectionLoadMap contains the following 2 entries:
	group=null
		location=192.168.1.5:56224; load=0.0
	group=__recv__group
		location=192.168.1.5:5370; load=0.00625

Update the load after ping connection has been made:

After another connection is made, the load is updated again.

[warn 2022/03/16 14:17:02.466 PDT locator-ln <P2P message reader for 192.168.1.5(ln-1:75228)<v1>:41002 unshared ordered sender uid=5 dom #1 local port=45635 remote port=56270> tid=0x5e] XXX LocatorLoadSnapshot.updateConnectionLoadMap current load for location=192.168.1.5:5370; group=__recv__group; inputLoad=0.0075; currentLoad=0.00625

[warn 2022/03/16 14:17:02.466 PDT locator-ln <P2P message reader for 192.168.1.5(ln-1:75228)<v1>:41002 unshared ordered sender uid=5 dom #1 local port=45635 remote port=56270> tid=0x5e] XXX LocatorLoadSnapshot.updateConnectionLoadMap updated load for location=192.168.1.5:5370; group=__recv__group; inputLoad=0.0075; newLoad=0.0075

[warn 2022/03/16 14:17:03.841 PDT locator-ln <Thread-14> tid=0x43] XXX LocatorLoadSnapshot.logConnectionLoadMap
The connectionLoadMap contains the following 2 entries:
	group=null
		location=192.168.1.5:56224; load=0.0
	group=__recv__group
		location=192.168.1.5:5370; load=0.0075

Connect another sender:

Another sender with 5 dispatcher threads connects, and the load is updated again.

[warn 2022/03/16 14:29:44.794 PDT locator-ln <Thread-14> tid=0x43] XXX LocatorLoadSnapshot.logConnectionLoadMap
The connectionLoadMap contains the following 2 entries:
	group=null
		location=192.168.1.5:56600; load=0.0
	group=__recv__group
		location=192.168.1.5:5190; load=0.015

Disconnect one sender:

When a sender disconnects, the load is updated again.

[warn 2022/03/16 14:30:38.843 PDT locator-ln <Thread-14> tid=0x43] XXX LocatorLoadSnapshot.logConnectionLoadMap
The connectionLoadMap contains the following 2 entries:
	group=null
		location=192.168.1.5:56600; load=0.0
	group=__recv__group
		location=192.168.1.5:5190; load=0.0075

Start another receiver:

When another receiver is started, an entry for it is added to the connectionLoadMap with load=0.0.

[warn 2022/03/16 14:35:07.535 PDT locator-ln <Thread-14> tid=0x43] XXX LocatorLoadSnapshot.logConnectionLoadMap
The connectionLoadMap contains the following 2 entries:
	group=null
		location=192.168.1.5:56940; load=0.0
		location=192.168.1.5:56833; load=0.0
	group=__recv__group
		location=192.168.1.5:5055; load=0.015
		location=192.168.1.5:5256; load=0.0

Two receivers and two senders:

When two receivers are started and two senders are connected, the load is updated (and balanced). In this case, the extra connections are pingers - one from each sender to each receiver.

[warn 2022/03/16 14:44:32.269 PDT locator-ln <Thread-14> tid=0x43] XXX LocatorLoadSnapshot.logConnectionLoadMap
The connectionLoadMap contains the following 2 entries:
	group=null
		location=192.168.1.5:57530; load=0.0
		location=192.168.1.5:57553; load=0.0
	group=__recv__group
		location=192.168.1.5:5349; load=0.00875
		location=192.168.1.5:5025; load=0.00875

Load balance senders:

This feature does not seem to be working properly. These changes seem to make it work better. I have another bunch of analysis on this that I will either post separately or file a JIRA on.

jvarenina · 2022-03-18T10:22:56Z

Hi @boglesby,

I also assumed that the same race condition is possible for the client connections, but I haven't tried to reproduce it. Thanks for pointing this out and lots of other valuable information. Also, thank you for the extensive testing you have done.

If we decide to go with this solution, I agree that we should make the load-poll-interval parameter configurable for gateway receivers. Changing it to the lower value would slightly mitigate race condition effects.

The load-balance gateways command is working on server this way:

pauses gateway-sender
destroys all connections and then rely upon the mechanism used during connection creation (ClientConnectionRequest/Response) to do the better load balancing
resume gateway-sender

This command will result again in the burst of connection requests that could hit an issue caused by a race condition.

Maybe instead of sending load information periodically from the servers, the locator could scrape it (perhaps using CacheServerMXBean) from the servers and apply it simultaneously for all receivers in the locator. The locator could get load when it receives a connection request, and the current connection load is stale (e.g., older than 200 ms), as we don't expect many connections from gateway-senders. This way, the locator would at least have an up-to-date connection load taken at a similar time on all servers. This solution should even catch the change in connection load when the load-balance command destroys all connections.

Maybe, an algorithm that could work this way:

Connection request received, check if a connection load is stale (older than new parameter load-update-frequency=200ms)
- if yes, then try to get connection load from all servers asynchronously
  - if received load from all servers, then apply it in the locator
  - if any get fails, then check profiles again and immediately retry for all servers
- Use immediately the current load
If the connection request is not received, then just periodically get load, e.g., every 5 seconds (load-poll-interval)

Not sure if this makes any sense as I don't know how fast locator can scrape the load. I can create a prototype if you see that this could maybe work?

boglesby · 2022-03-24T18:13:26Z

Thats a pretty cool idea. I'm not sure whether the CacheServerMXBean has that behavior, but I guess it could be added. In any event, I think this change is good. I'm approving this change, but you need to address the ParallelGatewaySenderConnectionLoadBalanceDistributedTest failure.

jvarenina · 2022-04-21T12:00:08Z

Hi @Bill , @echobravopapa , @kamilla1201 and @pivotal-jbarrett , this PR requires reviews from your side to merge it. Could you please review it?

jake-at-work · 2022-04-21T15:07:30Z

geode-core/src/main/java/org/apache/geode/distributed/internal/LocatorLoadSnapshot.java


+  @TestOnly
+  public synchronized Map<ServerLocationAndMemberId, ServerLoad> getGatewayReceiverLoadMap() {


Does this need to be public for the test or would package private be sufficient.

jake-at-work · 2022-04-21T15:10:26Z

...e-core/src/test/java/org/apache/geode/distributed/internal/LocatorLoadSnapshotJUnitTest.java

@@ -631,28 +633,93 @@ public void testFindBestServersCalledWithNegativeCount() {
  }

  @Test


Upgrade to JUnit 5.

Hi @pivotal-jbarrett , thanks for the review. Upgrading this test to JUnit 5 would be tricky because it uses Rule annotation, which is replaced by ExtendWith annotation. It would be necessary to implement the new interfaces (AfterEachCallback and BeforeEachCallback) to GfshCommandRule and ClusterStartupRule classes. I think that this should be a part of a separate ticket since this is not just a minor adjustment. What do you think?

Sorry, just realized that you referenced LocatorLoadSnapshotJUnitTest.java and not a ParallelGatewaySenderConnectionLoadBalanceDistributedTest.java). I will make this change and sorry again for misunderstanding your comment.

jake-at-work · 2022-04-21T15:10:49Z

...e-core/src/test/java/org/apache/geode/distributed/internal/LocatorLoadSnapshotJUnitTest.java

@@ -14,6 +14,7 @@
 */
 package org.apache.geode.distributed.internal;

+import static org.assertj.core.api.Assertions.assertThat;
 import static org.junit.Assert.assertEquals;


Convert to AssertJ.

jake-at-work · 2022-04-21T15:12:14Z

I think I would like @upthewaterspout to take a look at this too for good measure.

jake-at-work

Please use collections based AssertJ methods.

...e-core/src/test/java/org/apache/geode/distributed/internal/LocatorLoadSnapshotJUnitTest.java

The test case testMultiUser failed because Wan service is available in geode-core distributed tests, and therefore test now throws: org.apache.geode.internal.cache.wan.GatewaySenderConfigurationException : Locators must be configured before starting gateway-sender. instead of: java.lang.IllegalStateException: WAN service is not available.

jvarenina · 2022-05-11T15:24:56Z

This PR has been hanging for a long time now, and we should decide whether to close it or merge it.

I think this PR adds value to Apache geode if we at least "synchronize" sending of CacheServerLoadMessage on all servers. Current 5 seconds possible difference is just too much. I think this could be done with the following simple not so smart algorithm:

@@ -159,13 +162,33 @@ public class LoadMonitor implements ConnectionListener {
       }
     }
 
+    /**
+     * This function calculates next interval absolute time that is same on all servers in
+     * the cluster if following conditions are fulfilled:
+     * - same pollInterval value is used
+     * - time is synchronized on servers
+     *
+     * @return absolute time of next interval
+     */
+    private long getNextIntervalSynchronizedAbsoluteTime(final long currentTime,
+        final long pollInterval) {
+      return (currentTime - (currentTime % pollInterval)) + pollInterval;
+    }
+
     @Override
     public void run() {
       while (alive) {
         try {
           synchronized (signal) {
-            long end = System.currentTimeMillis() + pollInterval;
-            long remaining = pollInterval;
+            long currentTime = System.currentTimeMillis();
+            long end, remaining;
+            if (isGatewayReceiver) {
+              end = getNextIntervalSynchronizedAbsoluteTime(currentTime, pollInterval);
+              remaining = end - currentTime;
+            } else {
+              end = currentTime + pollInterval;
+              remaining = pollInterval;
+            }
             while (alive && remaining > 0) {
               signal.wait(remaining);
               remaining = end - System.currentTimeMillis();

@boglesby what do you think about this?

This commit synchronizes the getting and sending of gateway-receiver load (CacheServerLoadMessage) on all servers.

Changes are implemented, and there is no reply for several months.

* GEODE-10056: Improve gateway-receiver load balance The problem is that servers send incorrect gateway-receiver connection load to locators within CacheServerLoadMessage. Additionally, locators do not refresh gateway-receivers load with the load received in CacheServerLoadMessage. The only time locator increments gateway-receiver load is after it receives ClientConnectionRequest{group=__recv_group...} and returns selected server in ClientConnectionResponse message. This is done only by coordinator, so that means that other locators will have load with initial values, since it is never updated. The solution is to correctly track gateway-receiver acceptor connection count and then based on it correctly calculate the load when sending CacheServerLoadMessage. Additionally each locator will read the load received from CacheServerLoadMessage and update load for gateway-receiver location id in group __recv__group accordingly. * Updates after the review * Fix for the flaky test cases * Updates after review * Empty commit to trigger test * Updates after review * Fix failed distributed test The test case testMultiUser failed because Wan service is available in geode-core distributed tests, and therefore test now throws: org.apache.geode.internal.cache.wan.GatewaySenderConfigurationException : Locators must be configured before starting gateway-sender. instead of: java.lang.IllegalStateException: WAN service is not available. * Synchronize handling of receiver load This commit synchronizes the getting and sending of gateway-receiver load (CacheServerLoadMessage) on all servers.

jvarenina force-pushed the feature/GEODE-10056 branch 4 times, most recently from 2fabf45 to 0bdee7d Compare February 22, 2022 08:32

jvarenina changed the title ~~GEODE-10056: Work in progress~~ GEODE-10056: Improving gatway-reciever load handling Feb 22, 2022

jvarenina force-pushed the feature/GEODE-10056 branch from 0bdee7d to 272cfef Compare February 22, 2022 09:09

jvarenina changed the title ~~GEODE-10056: Improving gatway-reciever load handling~~ GEODE-10056: Improving gateway-reciever load handling Feb 22, 2022

jvarenina changed the title ~~GEODE-10056: Improving gateway-reciever load handling~~ GEODE-10056: Improving gateway-reciever load balancing Feb 22, 2022

jvarenina force-pushed the feature/GEODE-10056 branch 4 times, most recently from d68095e to ee8d485 Compare February 22, 2022 15:39

jvarenina marked this pull request as ready for review February 23, 2022 07:26

jvarenina requested review from gesterzhou, boglesby, nabarunnag, DonalEvans, jchen21, kirklund, Bill, echobravopapa and kamilla1201 as code owners February 23, 2022 07:26

onichols-pivotal reviewed Feb 23, 2022

View reviewed changes

onichols-pivotal requested a review from albertogpz February 23, 2022 07:35

jvarenina changed the title ~~GEODE-10056: Improving gateway-reciever load balancing~~ GEODE-10056: Improve gateway-receiver load balance Feb 23, 2022

DonalEvans requested changes Mar 4, 2022

View reviewed changes

Updates after the review

cea93c6

jvarenina force-pushed the feature/GEODE-10056 branch from ee8d485 to cea93c6 Compare March 8, 2022 08:39

boglesby approved these changes Mar 24, 2022

View reviewed changes

kirklund approved these changes Apr 7, 2022

View reviewed changes

Fix for the flaky test cases

74eb2d4

jvarenina requested a review from jake-at-work as a code owner April 11, 2022 15:06

jake-at-work requested changes Apr 21, 2022

View reviewed changes

jake-at-work requested a review from upthewaterspout April 21, 2022 15:12

jvarenina requested a review from rhoughton-pivot as a code owner April 22, 2022 14:05

jvarenina requested a review from jake-at-work April 22, 2022 14:05

jvarenina force-pushed the feature/GEODE-10056 branch from da3a03d to 30038f4 Compare April 25, 2022 07:45

Updates after review

6477c6b

jvarenina force-pushed the feature/GEODE-10056 branch from 30038f4 to 6477c6b Compare April 25, 2022 11:45

Empty commit to trigger test

e4100c2

jake-at-work previously requested changes Apr 25, 2022

View reviewed changes

Updates after review

4ce8c28

jvarenina requested a review from jake-at-work May 5, 2022 08:00

jvarenina requested review from jdeppe-pivotal and jinmeiliao as code owners May 11, 2022 10:27

Synchronize handling of receiver load

8d709a8

This commit synchronizes the getting and sending of gateway-receiver load (CacheServerLoadMessage) on all servers.

mivanac merged commit e627e60 into apache:develop Sep 8, 2022

jvarenina deleted the feature/GEODE-10056 branch September 13, 2022 08:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GEODE-10056: Improve gateway-receiver load balance #7378

GEODE-10056: Improve gateway-receiver load balance #7378

jvarenina commented Feb 17, 2022 •

edited

onichols-pivotal left a comment •

edited

jvarenina commented Feb 23, 2022 •

edited

DonalEvans left a comment

boglesby commented Mar 16, 2022

boglesby commented Mar 17, 2022

jvarenina commented Mar 18, 2022 •

edited

boglesby commented Mar 24, 2022

jvarenina commented Apr 21, 2022

jake-at-work Apr 21, 2022

jake-at-work Apr 21, 2022

jvarenina Apr 22, 2022 •

edited

jvarenina Apr 22, 2022

jake-at-work Apr 21, 2022

jake-at-work commented Apr 21, 2022

jake-at-work left a comment

jvarenina commented May 11, 2022 •

edited


		@TestOnly
		public synchronized Map<ServerLocationAndMemberId, ServerLoad> getGatewayReceiverLoadMap() {

		@@ -631,28 +633,93 @@ public void testFindBestServersCalledWithNegativeCount() {
		}

		@Test

GEODE-10056: Improve gateway-receiver load balance #7378

GEODE-10056: Improve gateway-receiver load balance #7378

Conversation

jvarenina commented Feb 17, 2022 • edited

For all changes:

onichols-pivotal left a comment • edited

Choose a reason for hiding this comment

jvarenina commented Feb 23, 2022 • edited

DonalEvans left a comment

Choose a reason for hiding this comment

boglesby commented Mar 16, 2022

boglesby commented Mar 17, 2022

The receiver exchanges profiles with the locator:

Sender connects to the receiver:

Update the load:

Update the load after ping connection has been made:

Connect another sender:

Disconnect one sender:

Start another receiver:

Two receivers and two senders:

Load balance senders:

jvarenina commented Mar 18, 2022 • edited

boglesby commented Mar 24, 2022

jvarenina commented Apr 21, 2022

jake-at-work Apr 21, 2022

Choose a reason for hiding this comment

jake-at-work Apr 21, 2022

Choose a reason for hiding this comment

jvarenina Apr 22, 2022 • edited

Choose a reason for hiding this comment

jvarenina Apr 22, 2022

Choose a reason for hiding this comment

jake-at-work Apr 21, 2022

Choose a reason for hiding this comment

jake-at-work commented Apr 21, 2022

jake-at-work left a comment

Choose a reason for hiding this comment

jvarenina commented May 11, 2022 • edited

jvarenina commented Feb 17, 2022 •

edited

onichols-pivotal left a comment •

edited

jvarenina commented Feb 23, 2022 •

edited

jvarenina commented Mar 18, 2022 •

edited

jvarenina Apr 22, 2022 •

edited

jvarenina commented May 11, 2022 •

edited