Defer initialization of JGroups after logging is set up by Quarkus #29131

ahus1 · 2024-04-29T07:46:39Z

Closes keycloak#29129 Signed-off-by: Alexander Schwartz <aschwart@redhat.com>

vmuzikar

@ahus1 Thank you for the PR! The change makes sense to me but let's wait for @mabartos as he has more context about the original change.

mabartos · 2024-05-13T11:30:50Z

@ahus1 Thanks for the investigation around it, good job!

This is unfortunate that these 'revert' changes are required, as they removed the no-work CPU gap as shown here (don't focus on the red circle, but rather on the CPU utilization during the time ~2.3s-4.2s):

It was expected that the CPU usage would be higher because these tasks for different threads leverage more parallel executions than in the figure above.

However, as you mentioned, the issue is more critical with the trace logs, assumedly written into the DelayedHandler.

I'm just wondering if we have some possibilities to avoid providing changes in this PR.
What about setting the log level for the logger in the org.jgroups.stack.Protocol class before its initialization and avoiding using the trace level?

At the time of executing the build step, we already have initialized the whole configuration, so if the trace level for JGroups is required, we can allow it.

To summarize it, we could:

Check if the trace level for JGroups is required from the configuration, otherwise, change the level to info or whatever
Set the level for the org.jgroups.stack.Protocol log via its config option?
Start the new cache thread
Do not move the build step into the RUNTIME_INIT phase :)

@ahus1 I haven't tried it, but it might be feasible, right?

ahus1 · 2024-05-14T09:19:06Z

@mabartos - the pause you see, is that maybe the pause where the discovery of ISPN takes place? It only applies when the first node is started.

My usual comment about optimizations applies here: If it makes things more fragile and complex, it is a trade-off with maintainability, and I'd rather not do it. Even if we fix it for the logging of the protocol, we might miss other locations, as the JGroups library is also "optimizing".

@pruivo - is the a way using the regular Java APIs to get hold of the instance of org.jgroups.protocols.TP to set the log level? If that is possible, one would also need to verify what kind of log framework JGroups detects here, as the Slf4j implementation doesn't allow for setting the log level and will throw an exception :-(

pruivo · 2024-05-14T09:33:36Z

@ahus1 after the CacheManager is instantiated:

var t = GlobalComponentRegistry.componentOf(cacheManager, Transport.class);
((JGroupsTransport)t).getChannel().getProtocolStack().getTransport().isTrace(false);

mabartos · 2024-05-14T09:55:48Z

@mabartos - the pause you see, is that maybe the pause where the discovery of ISPN takes place? It only applies when the first node is started.

Yep, AFAIK, during that phase, the cluster is analyzed and checks the existence of the coordinator. If there is any, the node will become the coordinator after exceeding the time. So, it should apply only to the first node, as you've mentioned.

Perhaps it'd be really better to stick with these changes you've provided, as the CPU gap for the first node is not a big deal. It'll probably be better in terms of maintainability, as you've mentioned.

mabartos

Based on my previous comment, I'm ok with these changes.

Defer initialization of JGroups after logging is set up by Quarkus

7f21882

Closes keycloak#29129 Signed-off-by: Alexander Schwartz <aschwart@redhat.com>

ahus1 self-assigned this Apr 29, 2024

vmuzikar approved these changes Apr 29, 2024

View reviewed changes

vmuzikar requested a review from mabartos April 29, 2024 08:33

keycloak-github-bot bot added the flaky-test label Apr 29, 2024

ahus1 marked this pull request as ready for review April 29, 2024 09:02

ahus1 requested review from a team as code owners April 29, 2024 09:02

keycloak-github-bot bot added team/cloud-native labels Apr 29, 2024

mabartos approved these changes May 14, 2024

View reviewed changes

vmuzikar merged commit 701e49e into keycloak:main May 14, 2024
65 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Defer initialization of JGroups after logging is set up by Quarkus #29131

Defer initialization of JGroups after logging is set up by Quarkus #29131

ahus1 commented Apr 29, 2024

vmuzikar left a comment

mabartos commented May 13, 2024

ahus1 commented May 14, 2024 •

edited

pruivo commented May 14, 2024

mabartos commented May 14, 2024

mabartos left a comment

Defer initialization of JGroups after logging is set up by Quarkus #29131

Defer initialization of JGroups after logging is set up by Quarkus #29131

Conversation

ahus1 commented Apr 29, 2024

vmuzikar left a comment

Choose a reason for hiding this comment

mabartos commented May 13, 2024

ahus1 commented May 14, 2024 • edited

pruivo commented May 14, 2024

mabartos commented May 14, 2024

mabartos left a comment

Choose a reason for hiding this comment

ahus1 commented May 14, 2024 •

edited