[AMQ-9394] Tech Preview: Virtual Thread support #1172

mattrpav · 2024-03-05T20:54:21Z

Tasks:

Throw exception if virtualThreadTaskRunner is enabled and JDK 21 (or higher) is not available
Breakout the Virtual Thread factory init so it only logs on JDK 17
Add webpage with instructions and implementation progress status (https://activemq.apache.org/virtual-threads)
Update 'yield()' usage to have proper syntax
Add a VirtualThreadTaskRunner
Add activemq-client-jdk21-test module
Add activemq-client-jdk21 to the assembly
Add @experimental annotation to communicate PREVIEW status
Add multi-consumer, multi-queue test results

VirtualThreadTaskRunnerBrokerTest results:

[INFO] -------------------------------------------------------
[INFO]  T E S T S
[INFO] -------------------------------------------------------
[INFO] Running org.apache.activemq.broker.VirtualThreadTaskRunnerBrokerTest
[INFO] Tests run: 127, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 15.01 s -- in org.apache.activemq.broker.VirtualThreadTaskRunnerBrokerTest
[INFO] 
[INFO] Results:
[INFO] 
[INFO] Tests run: 127, Failures: 0, Errors: 0, Skipped: 0

[DRAFT] Virtual Thread Phase 1 roadmap:

(This PR) Provide hooks to enable Virtual Thread executor to get clock started on non-production environment testing and allow for profiling.
Refactor all Thread Local references (there are only a few, non-impacting)
Refactor Queue/Topic usage of synchronized -> reentrantLock
Replace object monitor mutexes with Condition (not a lot of these either)

Once the above is done, that should provide the basic improvements for Virtual Thread scaling for lots of queues and topics. Once that is done, we can start the heavier refactoring to reduce blocking altogether.

[DRAFT] Virtual Thread Phase 2 roadmap:

Add fast-return on queue iterate() to return when queue size is zero (no work needed)
Refactor other areas that use a separate thread to just use a task or run in the queue iterate() -- expiration checking, producer flow control, etc
Refactor client-side thread handling to do Future -> CompletableFutures. (I think the client-side is especially primed to take advantage of modern JDK threading features.)
Update ConnectionFactory to support injecting a globally shared ExecutorService. So for high-density deployments you could have lots of connection factories using a shared Virtual Thread pool across REST service, other JMS integrations, etc.

cshannon

So I think I have to -1 merging this (into main at least) for now for a few reasons.

I thought the main point was to experiment with future support (as it says tech preview) in a branch...I think it's way too early to be merging anything to do with Virtual threads into the main branch, even if it's off by default. It would be nice to see some testing done for things like performance, etc first and see what other kinds of consequences we will run into. We have no clue how things are going to go as of now with it.
It makes no sense to stick with the same TaskRunner interface. The TaskRunner stuff that's in the broker we should be looking to just get rid of it anyways. It adds a lot of complexity and was added 20+ years ago before all the modern concurrent features for running tasks came about. We should be looking to get rid of that entirely and migrating to the built in Executor support already in Java and using things like Futures and CompletableFutues. So I think that should be step 1.
This piggy backs onto number 2, but the VirtualThreadTaskRunner in this PR is insanely complex and I think it can go away entirely if we refactor and get rid of TaskRunner stuff.

Ultimately, I think this is fine to keep in a branch for testing and experiment for now but it's likely going to need significant changes as i pointed out above, especially because I think the TaskRunner needs to go away. A better spot for this might be to create a branch for it in ActiveMQ repo to share so it can be tested.

cshannon · 2024-03-22T14:16:09Z

I should also add the caveat that while I said in my last comment I think it makes no sense to stick with the TaskRunner interface that there is still a possibility we may ultimately need to. We have a lot of custom code and impl there so we need to really dive into it to see if we can get rid of it or not or if we need to keep parts of it. There's also the possibility that we could refactor it but it's so much work it isn't worth it, but I hope not.

My initial assumption is we should be able to refactor things with modern concurrent features of java and get rid of some of that but obviously TBD. So, I don't think you should delete any of this code as maybe we end up having to use the new virtual thread task runner you created, but since that's TBD and also a tech preview that is being tested, for now I think it's best in another branch for until we figure out the plan.

I think we should take a look into the current implementations and see if they can be removed or improve/simplified as part of this. I can take a look soon.

mattrpav · 2024-03-22T15:16:07Z

@cshannon thanks for taking the time to review and providing the thoughtful feedback.

I agree, the threading model can be improved and leveraging more modern Java constructs would improve performance and maintainability. Overall, I think ActiveMQ is a good candidate for Virtual Threads, because the key locks are already ReentrantLocks and we have NIO on the network layer. There are a few scatter synchronized methods/blocks in queue and topic, but those can be readily refactored.

From my research of other OSS projects' (Tomcat, etc), they took a similar approach as I have proposed here -- getting Virtual Thread executor support as a configurable piece, and then use that to identify hot spots through profiling and end-user testing. From that perspective, I do feel strongly that we should work to get a configurable approach into the hands of end-users so we can get runtime hours in non-production environments. Unfortunately, I think the days of power users testing from a branch are behind us, so if we could work to some sort of compromise where its in the dist, but not guaranteed to not change I think we get the benefit of end-user testing which is really critical for this type of change.

My intent with 'Tech Preview' is to communicate that Virtual Thread support is available for testing, but not guaranteed to remain unchanged. I added the webpage to communicate that as well.

cshannon · 2024-03-22T15:43:31Z

My initial comments of not to include it were because I thought you meant with Tech Preview as it's just for testing, etc and not really intended for users yet. If your intent is for users to try it out and then I think it would be ok to merge if we mark it as experimental/beta in the code itself besides just the documentation.

As I already stated, I want to look at refactoring all of the TaskRunner usage, so it's possible the final result of this looks completely different and I just didn't want to merge something in that was a preview/testing feature that might break users.

Maybe we could add something similar to @beta annotation that Guava has to mark it as a preview feature and subject to breaking changes or removal as we don't know how it will go and I don't want to add it in if we can't get rid of it later.

My guess is by the time we hit AMQ 7.0 it would certainly be considered stable but could be earlier if we did work on the threading.

I suppose the other result of this change is it requires JDK 21 to build going forward for releases which I'm not sure how I feel about. The modules are optional but obviously we need to build them if we plan to release them.

mattrpav · 2024-03-22T16:09:51Z

My initial comments of not to include it were because I thought you meant with Tech Preview as it's just for testing, etc and not really intended for users yet. If your intent is for users to try it out and then I think it would be ok to merge if we mark it as experimental/beta in the code itself besides just the documentation.

Sounds good, I've added additional tasks here and will publish additional testing results.

Maybe we could add something similar to @beta annotation that Guava has to mark it as a preview feature and subject to breaking changes or removal as we don't know how it will go and I don't want to add it in if we can't get rid of it later.

A bit hacky, but we could use @deprecated with text that it is really beta info. Thoughts?

My guess is by the time we hit AMQ 7.0 it would certainly be considered stable but could be earlier if we did work on the threading.

Yep!

cshannon · 2024-03-22T16:40:07Z

I would not use @deprecated as that means something completely different. We should probably just create our own annotation and call it @Experimental or @Beta etc

cshannon · 2024-08-29T11:49:09Z

ARTEMIS-4937 reminded me to ask this question as I forgot, have you tried benchmarking this all or trying to verify we won't run into performance issues with pinning? I know your PR mentions refactoring but it hasn't been done and synchronized blocks are used all over the place. So I assume this will suffer from that for now, but it of course depends if it's a real issue but something we should communicate if it is.

Looks like at least the issue is on it's way to being solved (hopefully for the next LTS release): https://mail.openjdk.org/pipermail/loom-dev/2024-May/006632.html

cshannon · 2024-08-29T12:06:56Z

If testing shows there are issues with performance/pinning (and I'm sure there are) we may want to wait to release this as experimental or not, it would defeat the purpose of releasing it. Or at the very least have a big warning etc.

I personally don't think we should go around and just start messing with replacing a bunch of synchronized blocks just because of this. While we do need to fix the locking and synchronization in the broker, that is a major effort and needs to be well planned and is non-trival and not something that would likely happen in a 6.x release as it would be nice to modernize things for 7.x. And because there's a fix coming in the JDK (hopefully with the 25 LTS release) we could just wait for that.

Other projects like Caffeine are waiting for the fix as well.

mattrpav · 2024-08-29T12:21:24Z

@cshannon agree. Whether it is called “tech preview” or “experimental” is not a big deal to me.

This first pass is about getting some modules so unit/itests and profiling work can begin. Having a module allows other contributors start looking at it as well. This is sufficiently complex to get going, there is no user “accidentally” turns this on in production.

I will post test tool commands and results before merging any of this.

As far as refactoring goes, I agree. I don’t think any wholesale search+replace of synchronized is a safe approach for stability. I was thinking about modernization of the code base, and there are plenty of extension points we could leverage to test new code paths that are more VT-friendly using new impls vs refactor-in-place.

TransportConnector
PersistenceAdapter
QueueFactory (new idea to return different queue impls based on policy entry)
.. etc

cshannon · 2024-08-29T12:49:06Z

The problem here is the stated goals you just listed argue for NOT releasing this. Releases are for delivering to end users, but the goal here is not that. The goals are for profiling, testing, unit tests and for other contributors to get involved.

The more I think about this the more I think it's just a really bad idea to deliver something that is completely untested and not even close to ready and likely has actual problems to end users.

Instead i think a better approach is what we did on Accumulo when we had our long running "elasticity" branch. We can create a branch for this work in repo that way others can contribute and we can work on the feature until it's closer to being ready.

We should create a new branch (call it something like virtual-threads or whatever you want) and use that as the basis of this work.
We can keep it up to date by periodically merging main into the branch (merge only, no rebases as we don't want to break history)
Developers can open up Pull Requests against this new branch for virtual thread changes so that we can work on pieces at a time and merge them in as they are done.
When it's ready, we merge this branch back into main.

cshannon · 2024-08-29T13:14:41Z

So I will say that maintaining another branch can be a bit of a pain so I think the other option I would be ok with if we want to release this now would be to just improve the documentation a lot.

The feature is of course marked as a tech preview and experimental but it's not super clear as to the drawbacks or warnings if you try and use it. This page only currently lists benefits and why you would want to use it so it's kind of almost tempting an end user to want to turn it on prematurely.

So I think maybe the following:

Update the documentation page to talk about more about the current state and how far a long the implementation is. We should make it clear it's just for testing and evaluation and basically define why it's a Tech Preview and experimental.
The page has a benefits section but we could also add a section on potential problems and warnings for now.
Update all the Javadocs for the new classes to give information on potential warnings and pitfalls. They are marked with the experimental annotation but there's not a lot of information as to why.

mattrpav self-assigned this Mar 5, 2024

mattrpav changed the title ~~[AMQ-9394] Tech Preview: Virtual Thread support~~ WIP: [AMQ-9394] Tech Preview: Virtual Thread support Mar 5, 2024

mattrpav mentioned this pull request Mar 5, 2024

WIP: [AMQ-9394] Tech Preview: Virtual Thread support #1121

Closed

6 tasks

jbonofre force-pushed the main branch from cd5c993 to 5446874 Compare March 8, 2024 14:20

mattrpav force-pushed the AMQ-9394 branch 4 times, most recently from b468134 to 121b679 Compare March 18, 2024 14:32

mattrpav changed the title ~~WIP: [AMQ-9394] Tech Preview: Virtual Thread support~~ [AMQ-9394] Tech Preview: Virtual Thread support Mar 18, 2024

mattrpav requested review from jbonofre and cshannon March 18, 2024 14:32

mattrpav marked this pull request as ready for review March 18, 2024 14:33

cshannon requested changes Mar 22, 2024

View reviewed changes

[AMQ-9394] Tech Preview: Virtual Thread support

01a6166

mattrpav force-pushed the AMQ-9394 branch from 121b679 to 7afc9cf Compare March 23, 2024 14:45

mattrpav added 2 commits March 26, 2024 09:09

[AMQ-9394] Add Experimental annotation

053787a

[AMQ-9394] Annotate Virtual Thread support with @experimental

0dd7991

mattrpav force-pushed the AMQ-9394 branch from 7afc9cf to 0dd7991 Compare March 26, 2024 14:10

mattrpav requested a review from cshannon May 31, 2024 19:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMQ-9394] Tech Preview: Virtual Thread support #1172

[AMQ-9394] Tech Preview: Virtual Thread support #1172

mattrpav commented Mar 5, 2024 •

edited

Loading

cshannon left a comment

cshannon commented Mar 22, 2024

mattrpav commented Mar 22, 2024 •

edited

Loading

cshannon commented Mar 22, 2024

mattrpav commented Mar 22, 2024

cshannon commented Mar 22, 2024

cshannon commented Aug 29, 2024

cshannon commented Aug 29, 2024

mattrpav commented Aug 29, 2024

cshannon commented Aug 29, 2024 •

edited

Loading

cshannon commented Aug 29, 2024

[AMQ-9394] Tech Preview: Virtual Thread support #1172

Are you sure you want to change the base?

[AMQ-9394] Tech Preview: Virtual Thread support #1172

Conversation

mattrpav commented Mar 5, 2024 • edited Loading

cshannon left a comment

Choose a reason for hiding this comment

cshannon commented Mar 22, 2024

mattrpav commented Mar 22, 2024 • edited Loading

cshannon commented Mar 22, 2024

mattrpav commented Mar 22, 2024

cshannon commented Mar 22, 2024

cshannon commented Aug 29, 2024

cshannon commented Aug 29, 2024

mattrpav commented Aug 29, 2024

cshannon commented Aug 29, 2024 • edited Loading

cshannon commented Aug 29, 2024

mattrpav commented Mar 5, 2024 •

edited

Loading

mattrpav commented Mar 22, 2024 •

edited

Loading

cshannon commented Aug 29, 2024 •

edited

Loading