[WIP]Use DefaultBlockingPool for Global Processing Pool instead of StupidPool that can allocate arbitrary number of buffers and cause crashes. #5345

himanshug · 2018-02-05T16:25:13Z

also following

removes OffheapIncrementalIndex.java ( see [WIP]Use DefaultBlockingPool for Global Processing Pool instead of StupidPool that can allocate arbitrary number of buffers and cause crashes. #5345 (comment)
ParallelCombiner now uses a merge buffer rather than a processing buffer ( see Add streaming aggregation as the last step of ConcurrentGrouper if data are spilled #4704 (review) )

TODO: check if hardcoded timeout of 1 minute in all places is OK.

…ool that can allocate arbitrary number of buffers and cause crashes.

drcrallen · 2018-02-07T21:56:56Z

processing/src/main/java/io/druid/segment/incremental/OffheapIncrementalIndex.java

@@ -188,7 +188,7 @@ protected Integer addToFacts(
            lastBuffer.capacity() - bufferOffset >= aggsTotalSize) {
          aggBuffer = lastBuffer;
        } else {
-          ResourceHolder<ByteBuffer> bb = bufferPool.take();
+          ResourceHolder<ByteBuffer> bb = bufferPool.takeOrFailOnTimeout(60000);


Is this competing with the processing pool? that means you can choke out processing threads on accident while incremental indexing is going on.

yes, that would be true.
i think it is more of a limitation of current OffheapIncrementalIndex implementation which can not work with fixed amount of resources and would keep on allocating more and more buffers. also, this impl keeps dimensions etc on-heap so doesn't really serve purpose of being off-heap ... things become too slow if dimensions are pushed off-heap due to repeated serde .
FWIW, for above reasons, no-one actually uses current OffheapIncrementalIndex implementation and we can possibly remove it.

I'm considering the removal of OffheapIncrementalIndex in this patch.

also it was only ever used by GroupBy-v1 implementation if explicitly configured .. it was not possible to use OffheapIncrementalIndex for indexing actually.

drcrallen · 2018-02-07T21:57:41Z

processing/src/main/java/io/druid/segment/incremental/OffheapIncrementalIndex.java

-    //check that stupid pool gives buffers that can hold at least one row's aggregators
-    ResourceHolder<ByteBuffer> bb = bufferPool.take();
+    //check that buffer pool gives buffers that can hold at least one row's aggregators
+    ResourceHolder<ByteBuffer> bb = bufferPool.takeOrFailOnTimeout(60000);


This can also choke the processing pool, right?

drcrallen · 2018-02-13T23:56:41Z

common/src/main/java/io/druid/collections/BlockingPool.java

+   *
+   * @return a resource, or throw RuntimeException on timeout.
+   */
+  default ReferenceCountingResourceHolder<T> takeOrFailOnTimeout(long timeoutMs)


Should this throw a checked Timeout exception of some kind?

drcrallen · 2018-02-14T00:01:13Z

server/src/main/java/io/druid/guice/DruidProcessingModule.java

        new OffheapBufferGenerator("intermediate processing", config.intermediateComputeSizeBytes()),
-        config.getNumThreads(),
-        config.poolCacheMaxCount()
+        config.getNumThreads()


does this mean poolCacheMaxCount needs to be removed from docs?

jihoonson · 2018-09-17T21:09:32Z

Hi guys, I don't think this should be a blocking issue for 0.13.0 release. I'll remove the milestone.

stale · 2019-02-28T05:14:55Z

This pull request has been marked as stale due to 60 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull request requires a review, please simply write any comment. If closed, you can revive the PR at any time and @mention a reviewer or discuss it on the dev@druid.apache.org list. Thank you for your contributions.

stale · 2019-03-07T06:09:42Z

This pull request has been closed due to lack of activity. If you think that is incorrect, or the pull request requires review, you can revive the PR at any time.

Use DefaultBlockingPool for Global Processing Pool instead of StupidP…

1b4e18a

…ool that can allocate arbitrary number of buffers and cause crashes.

drcrallen reviewed Feb 7, 2018

View reviewed changes

remove OffheapIncrementalIndex

fcf39b1

himanshug mentioned this pull request Feb 8, 2018

Add streaming aggregation as the last step of ConcurrentGrouper if data are spilled #4704

Merged

dont use processing buffer in ParallelCombiner

a5ecf61

himanshug force-pushed the fix_size_intermediate_pool branch from 88a457a to a5ecf61 Compare February 9, 2018 15:50

himanshug added this to the 0.13.0 milestone Feb 9, 2018

himanshug added the Release Notes label Feb 9, 2018

drcrallen reviewed Feb 13, 2018

View reviewed changes

drcrallen reviewed Feb 14, 2018

View reviewed changes

jihoonson removed this from the 0.13.0 milestone Sep 17, 2018

stale bot added the stale label Feb 28, 2019

stale bot closed this Mar 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP]Use DefaultBlockingPool for Global Processing Pool instead of StupidPool that can allocate arbitrary number of buffers and cause crashes. #5345

[WIP]Use DefaultBlockingPool for Global Processing Pool instead of StupidPool that can allocate arbitrary number of buffers and cause crashes. #5345

himanshug commented Feb 5, 2018 •

edited

drcrallen Feb 7, 2018

himanshug Feb 8, 2018

himanshug Feb 8, 2018

himanshug Feb 8, 2018

drcrallen Feb 7, 2018

drcrallen Feb 13, 2018

drcrallen Feb 14, 2018

jihoonson commented Sep 17, 2018

stale bot commented Feb 28, 2019

stale bot commented Mar 7, 2019

[WIP]Use DefaultBlockingPool for Global Processing Pool instead of StupidPool that can allocate arbitrary number of buffers and cause crashes. #5345

[WIP]Use DefaultBlockingPool for Global Processing Pool instead of StupidPool that can allocate arbitrary number of buffers and cause crashes. #5345

Conversation

himanshug commented Feb 5, 2018 • edited

drcrallen Feb 7, 2018

Choose a reason for hiding this comment

himanshug Feb 8, 2018

Choose a reason for hiding this comment

himanshug Feb 8, 2018

Choose a reason for hiding this comment

himanshug Feb 8, 2018

Choose a reason for hiding this comment

drcrallen Feb 7, 2018

Choose a reason for hiding this comment

drcrallen Feb 13, 2018

Choose a reason for hiding this comment

drcrallen Feb 14, 2018

Choose a reason for hiding this comment

jihoonson commented Sep 17, 2018

stale bot commented Feb 28, 2019

stale bot commented Mar 7, 2019

himanshug commented Feb 5, 2018 •

edited