ARTEMIS-6009 Performance improvement when consuming large messages by AntonRoskvist · Pull Request #6369 · apache/artemis

AntonRoskvist · 2026-04-17T13:15:52Z

Current solution reads the message payload using Javas OutputStream default implementation which iterates over a given byte array and returns an int for each value.

This change instead passes the byte array "as is" through the ActiveMQOutputStream associated with the large message into its ActiveMQBuffer.

I'm seeing a "real world" performance improvement of about 170% for a client handling exclusively large messages.

I'm not quite sure how I go about writing a meaningful test for this, any feedback on that would be greatly appreciated.

jbertram · 2026-04-19T02:22:09Z

Looking at the implementation of org.apache.activemq.artemis.api.core.ActiveMQBuffer#writeBytes(byte[], int, int) versus java.io.OutputStream#write(byte[], int, int) I can see why the former would be faster (i.e. since it's writing in chunks instead of each individual byte).

I might be would be worth having a JMH test (see tests/performance-jmh). If you put the test into its own commit folks can cherry-pick it and test it on branches with different implementations and compare the results. It would be great if you could summarize the results here, if possible.

Aside from that, regression tests would probably suffice.

AntonRoskvist · 2026-04-20T13:10:06Z

Thanks @jbertram,

I've never worked with JMH previously so that might take some time to get in place...

In the meantime, I have some additional figures from what I'm seeing when testing this change:

In a "real world" scenario, i.e broker and client running on dedicated servers, communicating over a network:
Without PR: Client can process an average of 673 msgs/s, running on 100% CPU.
With PR: Client can process an average of 2200 msgs/s, using ~50% CPU.
(current bottleneck there is defined limit on network utilization from the cloud provider)

I've also set up and run a test locally, using a standalone broker (2.53.0), default configuration, with 300k messages preloaded like this:
bin/artemis producer --message-count 60000 --text-size 600000 --destination queue://LARGE.MSG.LOAD --threads 5 --url 'tcp://localhost:61616?compressLargeMessage=true'

Messages are then consumed by a "cli consumer" using either broker release 2.53.0 or a version built on top of this PR.

Messages in this test are compressed to save on storage space

This is the command used to consume messages:
bin/artemis consumer --message-count 60000 --destination queue://LARGE.MSG.LOAD --threads 5

Results:
Without PR: Consumer finishes in 1316 seconds, averaging 228 msgs/s
With PR: Consumer finishes in 79 seconds, averaging 3797 msgs/s

I collected flame graphs from the local tests which I have added here:
org_flamegraph.html
pr_flamegraph.html

tabish121 · 2026-04-20T13:25:48Z

The change makes sense and maps to how this is normally handled in other areas of the broker code as well. You normally would override those built in stream methods as the default implementation simply calls the main writeByte method in a loop which is quite inefficient.

AntonRoskvist · 2026-04-20T14:10:22Z

Also, when I said: "I'm not quite sure how I go about writing a meaningful test for this, any feedback on that would be greatly appreciated."

I meant that I'm not quite sure how to go about writing a regression test to validate the new behavior... at least not without relaxing access to the ActiveMQOutputStream and adding some absolute spaghetti around it... I also tried using Mockito but gave up as I simply could not get it to work properly.

I'll keep trying but if anyone has an idea, It's probably better than what I'm currently piecing together.

jbertram · 2026-04-20T14:37:58Z

@AntonRoskvist those results are compelling! Nice work.

Previously when I said, "...regression tests would probably suffice," I meant that existing tests would probably suffice for detecting regressions.

Ultimately, if you provide a way for folks to independently verify the performance improvement and the existing test-suite is green then I think that's sufficient.

tabish121 · 2026-04-21T18:56:16Z

I ran this through CI and all tests are passing. The commit message does not contain the related JIRA which needs to be fixed.

AntonRoskvist · 2026-04-21T19:53:21Z

@jbertram @tabish121 thanks!

I've been unable to make a decent JMH-test for this, so I instead added a very simple test under "soak-tests" in a secondary commit. I'm very open to exclude that unless you feel it adds some value.

If nothing else it should serve as a simple way to try this out for yourselves.

jbertram · 2026-04-21T20:39:07Z

@AntonRoskvist I used your test on main and I got around 39s and on your branch it was around 9s. Given the test-suite is green I'm merging this. Thanks!

clebertsuconic · 2026-04-24T20:43:52Z

@AntonRoskvist / @jbertram I'm replacing the soak test by a MockedTest

This test was taking 3 minutes on my laptop, and 2 minutes on the CI.

The MockedTest I wrote will validate the write([], int, int) was used.

clebertsuconic · 2026-04-24T21:38:59Z

PR sent here to replace test: #6385

jbertram reviewed Apr 19, 2026

View reviewed changes

Comment thread ...lient/src/main/java/org/apache/activemq/artemis/core/client/impl/ClientLargeMessageImpl.java Outdated

AntonRoskvist force-pushed the ARTEMIS-6009 branch from f1faff6 to 4e12299 Compare April 20, 2026 13:09

ARTEMIS-6009 Performance improvement when consuming large messages

588eb75

AntonRoskvist force-pushed the ARTEMIS-6009 branch from 4e12299 to 2afdc3c Compare April 21, 2026 19:47

AntonRoskvist changed the title ~~Performance improvement when consuming large messages~~ ARTEMIS-6009 Performance improvement when consuming large messages Apr 21, 2026

AntonRoskvist force-pushed the ARTEMIS-6009 branch from 2afdc3c to e95232b Compare April 21, 2026 19:51

ARTEMIS-6009 Added soak test

7c9aefa

AntonRoskvist force-pushed the ARTEMIS-6009 branch from e95232b to 7c9aefa Compare April 21, 2026 19:55

jbertram merged commit ae17380 into apache:main Apr 21, 2026
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARTEMIS-6009 Performance improvement when consuming large messages#6369

ARTEMIS-6009 Performance improvement when consuming large messages#6369
jbertram merged 2 commits intoapache:mainfrom
AntonRoskvist:ARTEMIS-6009

AntonRoskvist commented Apr 17, 2026

Uh oh!

Uh oh!

jbertram commented Apr 19, 2026 •

edited

Loading

Uh oh!

AntonRoskvist commented Apr 20, 2026

Uh oh!

tabish121 commented Apr 20, 2026

Uh oh!

AntonRoskvist commented Apr 20, 2026

Uh oh!

jbertram commented Apr 20, 2026

Uh oh!

tabish121 commented Apr 21, 2026

Uh oh!

AntonRoskvist commented Apr 21, 2026

Uh oh!

jbertram commented Apr 21, 2026

Uh oh!

Uh oh!

clebertsuconic commented Apr 24, 2026

Uh oh!

clebertsuconic commented Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

AntonRoskvist commented Apr 17, 2026

Uh oh!

Uh oh!

jbertram commented Apr 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AntonRoskvist commented Apr 20, 2026

Uh oh!

tabish121 commented Apr 20, 2026

Uh oh!

AntonRoskvist commented Apr 20, 2026

Uh oh!

jbertram commented Apr 20, 2026

Uh oh!

tabish121 commented Apr 21, 2026

Uh oh!

AntonRoskvist commented Apr 21, 2026

Uh oh!

jbertram commented Apr 21, 2026

Uh oh!

Uh oh!

clebertsuconic commented Apr 24, 2026

Uh oh!

clebertsuconic commented Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jbertram commented Apr 19, 2026 •

edited

Loading