ARTEMIS-1248 Reduce garbage while Paging #1361

franz1981 · 2017-06-22T16:22:31Z

No description provided.

clebertsuconic · 2017-06-22T18:34:15Z

@franz1981 if you were using a thread local on activeMQBuffer and unsafeByteBufWrapper I would agree with you on saving GC.

however... Page won't last much longer on a regular scenario. The Page will be served through PageCursorProviderImpl::getPageCache(); So.. this will be read basically only once... or if the cache on the PageCursorProviderImpl is beyond the limit, then the whole Page object would go away.

so, you won't ever read it twice if you get the opportunity of reusing the field you're trying to reuse. it will probably be from a different object at the time you get there.

So, I would say either use a ThreadLocal, or keep it the way it is.. since the Objects you are caching will be GCed from the released Page when the cache is full anyways.

franz1981 · 2017-06-22T21:30:54Z

@clebertsuconic yep agree on the read side: I've done it just for completeness, but the biggest gain (measured) is on the write path. I've changed the sequence of writing too to be only sequential in order to avoid cache misses.

clebertsuconic · 2017-06-22T23:22:18Z

@franz1981 I see.. you could still use a ThreadLocal there.. right?

also: Why don't you use a PooledBuffer on the write?

franz1981 · 2017-06-23T06:03:06Z

@clebertsuconic There is no need to use a ThreadLocal or a pooled buffer there because:

the method (write/read) is synchronized and it is safe to reuse the same ByteBuf wrapper between different threads (note: is only a wrapper on the ByteBuffer provided by the file factory)
the file factory already provides a (thread local) pooled direct ByteBuffer (on NIO)

In the old version the most of the garbage was already avoided due to the fileFactory ByteBuffer thread local pooling, but I've measured after multiple Page writes lots of minor GCs due to many short living ActiveMQBuffer instances. This change aims reduce that instances to be only 1 per Page: it is not optimal but is a big improvement.
With 100 bytes messages and a Page of 10 MB it avoids about 104856 instances of garbage.

michaelandrepearce · 2017-06-23T07:16:16Z

I agree this makes sense to me. @franz1981 is there a before and after vm gc / profiling graphs showing the drop in object creation and like wise drop in garabage collections. Will be the easiest proof that this change is beneficial. (We expect it to be btw)

franz1981 · 2017-06-23T07:58:39Z

@michaelandrepearce @clebertsuconic
You're right: these are the results of a test writing 100 Pages of 10MB with messages of 100 bytes.
Just 2 notes:

I've used the Artemis JVM default settings: -XX:+PrintClassHistogram -XX:+UseG1GC -XX:+AggressiveOpts -XX:+UseFastAccessorMethods -Xms512M -Xmx2G.
With smaller HEAP the effect is even more pronounced.
I've forced a GC just at the beginning to have the HEAP clean.

Old version:

New version:

wdyt?

michaelandrepearce · 2017-06-23T08:00:38Z

Looks really good. Test is quite short time wise, do you have an equiv for say a 30min to hour period.

franz1981 · 2017-06-23T08:28:55Z

@michaelandrepearce I can run it for that extended period of time, but consider that this one is an isolated test with only page writes, hence:

no Page reads are performed
no message is collected in the LivePageCache and in the StorageManager (for replication)
no networking/protocols mangement is polluting the heap with garbage

This is necessary to show the isolated impact of this little change: with the whole broker you'll risk to run out of disk space if no Page reads is involved.

michaelandrepearce · 2017-06-23T12:27:52Z

I think it's worthwhile (agreed the short one shows the improvement in isolation)
But in general people run the brokers as long running concerns as such should be put in that light.

clebertsuconic · 2017-06-23T12:54:36Z

@franz1981 Page itself is short lived. it will create a new one every time we move to a different file. I was looking for a way to have the wrapper living beyond the existence of page, and to be ready for the next write on the next page.

I was wondering if the write operation should live on Page itself or on StorageManager BTW.

And also, I wasn't aware of the recent change at the FileFactory you recently made in May for the ThreadLocal Buffer.. I think we should use Pooled instead... it would be more appropriate I think.. but that's a different discussion.

franz1981 · 2017-06-23T13:10:44Z

@clebertsuconic Having only one instance would be optimal, but considering that is a simple wrapper with no byte[] or off-heap memory allocated into it, just avoiding 100K it's a good compromise IMHO.

About the changes on the NIO file factory, I've already tried (and measured) a solution with the Netty pools before pushing that version, but I found it to not be a good solution in this case for several reasons:

it is slower if compared to the TLAB slution I've implemented (the pool is not shared across threads)
2, it grows the overall memory footprint and can be cause of leaks
it doesn't work nicely with "plain" ByteBuffers, forcing the use of ByteBuf (not a good idea for normal file API)

franz1981 · 2017-06-23T13:50:30Z

@michaelandrepearce Yes, agree...But in that case (probably) the thread pools executor services internal queues (+ capturing Runnables), the protocol handling + Netty will be the most important factories of garbage (from the GC point of view). Anyway I'll give try just to find out the current state of it and how it could impact in that case...but I don't know if I've enough disk to perform a test of 30 minutes :P

clebertsuconic · 2017-06-23T13:53:10Z

@franz1981 did you run the whole testsuite on this?

franz1981 · 2017-06-23T13:55:26Z

@clebertsuconic not yet! I'll run it now if is free 👍

clebertsuconic · 2017-06-23T14:45:01Z

@franz1981 I merged this by accident... will leave it be now :)

running the testsuite just to make sure it's ok

ARTEMIS-1248 Reduce garbage while Paging

6ca9e4c

franz1981 force-pushed the pooled_activemq_buffer_on_page branch from 13bdfa5 to 6ca9e4c Compare June 23, 2017 14:28

asfgit closed this in b640a62 Jun 23, 2017

franz1981 deleted the pooled_activemq_buffer_on_page branch October 27, 2017 07:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARTEMIS-1248 Reduce garbage while Paging #1361

ARTEMIS-1248 Reduce garbage while Paging #1361

franz1981 commented Jun 22, 2017

clebertsuconic commented Jun 22, 2017

franz1981 commented Jun 22, 2017

clebertsuconic commented Jun 22, 2017

franz1981 commented Jun 23, 2017 •

edited

michaelandrepearce commented Jun 23, 2017

franz1981 commented Jun 23, 2017

michaelandrepearce commented Jun 23, 2017

franz1981 commented Jun 23, 2017

michaelandrepearce commented Jun 23, 2017

clebertsuconic commented Jun 23, 2017

franz1981 commented Jun 23, 2017

franz1981 commented Jun 23, 2017

clebertsuconic commented Jun 23, 2017

franz1981 commented Jun 23, 2017

clebertsuconic commented Jun 23, 2017

ARTEMIS-1248 Reduce garbage while Paging #1361

ARTEMIS-1248 Reduce garbage while Paging #1361

Conversation

franz1981 commented Jun 22, 2017

clebertsuconic commented Jun 22, 2017

franz1981 commented Jun 22, 2017

clebertsuconic commented Jun 22, 2017

franz1981 commented Jun 23, 2017 • edited

michaelandrepearce commented Jun 23, 2017

franz1981 commented Jun 23, 2017

michaelandrepearce commented Jun 23, 2017

franz1981 commented Jun 23, 2017

michaelandrepearce commented Jun 23, 2017

clebertsuconic commented Jun 23, 2017

franz1981 commented Jun 23, 2017

franz1981 commented Jun 23, 2017

clebertsuconic commented Jun 23, 2017

franz1981 commented Jun 23, 2017

clebertsuconic commented Jun 23, 2017

franz1981 commented Jun 23, 2017 •

edited