perf: remove union-by-size for lower memory and better performance #17

william-silversmith · 2019-06-10T22:30:39Z

Related issues #15 #16

I'd like to understand why this is faster for the images I've attempted this on before merging.

william-silversmith · 2019-06-10T22:37:41Z

import cc3d
import numpy as np
import time

for i in range(10):
  labels = np.random.randint(0,2, size=(512, 512, 512), dtype=np.bool)

  start = time.time()
  labels = cc3d.connected_components(labels)
  print(time.time() - start, "sec")

Running cc3d against random binary images 10x. (black) cc3d 1.2.0 (blue) Without union by size (red) scipy

william-silversmith · 2019-06-10T22:43:17Z

Here's a different test on 512x512x512 uint32 connectomics data. 10 iterations of each algorithm.

(black) cc3d 1.2.0 (blue) without union by size (red) scipy (which treats the entire volume as a single label...)

william-silversmith · 2019-06-10T23:41:23Z

I wonder if this is an L1/L2 cache effect. If you are storing fewer cache lines of the size array in cache, there's more room for the labels.

william-silversmith · 2019-06-11T00:49:06Z

It looks like scipy uses a little more than 128 MB (input uint8 image) + 512 MB (output int32 image). This would be hard to beat without a sparse representation of the equivalence table.

william-silversmith · 2019-06-11T00:51:29Z

For example, here's how scipy does on an np.arange( 512**3 ) + 1 reshaped into a 512x512x512:

william-silversmith · 2019-06-11T01:48:55Z

I don't have a fantastic explanation, but this seems faster on all the examples I'm throwing at it so... 🤷‍♂️

william-silversmith · 2019-06-11T23:46:21Z

I think I might have an explanation for why this is okay. The Big-O for "find" in union-find is not as good when you remove union-by-rank or union-by-size, however this is the worst case and it is improved by path compression after accessing. Since we are typically processing the same label many times on an image, it's possible that the first access will be bad, but on average / amortized, the access will be fast. The union-by-size / union-by-rank code simply adds constant overhead and impairs the cache in that case.

perf: remove union-by-size for lower memory and better performance

bcc1d03

william-silversmith added the performance Lower memory or faster computation. label Jun 10, 2019

william-silversmith self-assigned this Jun 10, 2019

william-silversmith mentioned this pull request Jun 10, 2019

max_labels questions #15

Closed

william-silversmith merged commit 47c5716 into master Jun 11, 2019

william-silversmith deleted the wms_drop_union_by_size branch June 11, 2019 01:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: remove union-by-size for lower memory and better performance #17

perf: remove union-by-size for lower memory and better performance #17

william-silversmith commented Jun 10, 2019

william-silversmith commented Jun 10, 2019 •

edited

Loading

william-silversmith commented Jun 10, 2019 •

edited

Loading

william-silversmith commented Jun 10, 2019

william-silversmith commented Jun 11, 2019

william-silversmith commented Jun 11, 2019

william-silversmith commented Jun 11, 2019

william-silversmith commented Jun 11, 2019

perf: remove union-by-size for lower memory and better performance #17

perf: remove union-by-size for lower memory and better performance #17

Conversation

william-silversmith commented Jun 10, 2019

william-silversmith commented Jun 10, 2019 • edited Loading

william-silversmith commented Jun 10, 2019 • edited Loading

william-silversmith commented Jun 10, 2019

william-silversmith commented Jun 11, 2019

william-silversmith commented Jun 11, 2019

william-silversmith commented Jun 11, 2019

william-silversmith commented Jun 11, 2019

william-silversmith commented Jun 10, 2019 •

edited

Loading

william-silversmith commented Jun 10, 2019 •

edited

Loading