Use an optimized crc32 library which is faster #527

eapache · 2015-08-31T14:19:36Z

Needs testing to ensure that this is actually faster (and is still correct), but may solve #255.

wvanbergen · 2015-08-31T14:20:38Z

We may also want to check what architectures this will run on, because it uses CPU instructions that may not be available everywhere, and I don't know if it does fallbacks.

eapache · 2015-08-31T14:24:23Z

On a quick glance it uses +build flags and implements the stdlib version as a fallback, so we should be OK.

wvanbergen · 2015-08-31T14:26:21Z

OK, let's run some benchmarks!

eapache · 2015-08-31T14:59:55Z

My benchmark shows no difference; possibly vagrant does not support the necessary instructions. Since golang profiling is broken on native MacOS, we may have to run something on one of our servers.

eapache · 2015-08-31T15:17:57Z

Even on a bare-metal server whose /proc/cpuinfo says it supports SSE4.2 I see the same amount of time spent in the fallback method... not sure what else I need to do to actually make the optimizations run.

eapache · 2015-08-31T16:27:50Z

Welp, found a bug in our code now when benchmarking. Another PR incoming.

eapache · 2015-08-31T16:37:11Z

Short version is: this basically removes CRC32 from the profile entirely, it drops from 15-20% of the CPU time in my test to a statistically insignificant number of samples. Sold.

Also, the upstream author has submitted this to Go, so there's a very good chance Go 1.6 will have this optimization in the stdlib.

Use an optimized crc32 library which is faster

wvanbergen · 2015-08-31T16:56:56Z

👏

I discovered a "send on closed channel" panic in the consumer while testing #527 which I was finally able to track down. If a partition takes a long time to drain to the user, then the responseFeeder reclaims its ownership token from the broker so that the broker doesn't block its other partitions. However, if the user closes the PartitionConsumer (closing the dying channel) then the brokerConsumer will unconditionally return the ownership token to the dispatcher even if the responseFeeder is holding it. This results in two ownership tokens for the same partition (one in the feeder, one in the dispatcher) which leads to all sorts of subtle brokeness. It manifested in at least two different "send on closed channel" backtraces depending on the exact timing, and possibly more.

I discovered a "send on closed channel" panic in the consumer while testing #527 which I was finally able to track down. If a partition takes a long time to drain to the user, then the responseFeeder reclaims its ownership token from the broker so that the broker doesn't block its other partitions. However, if the user closes the PartitionConsumer (closing the dying channel) then the brokerConsumer will unconditionally return the ownership token to the dispatcher even if the responseFeeder is holding it. This results in two ownership tokens for the same partition (one in the feeder, one in the dispatcher) which leads to all sorts of subtle brokeness. It manifested in at least two different "send on closed channel" backtraces depending on the exact timing, and possibly more. To fix, move the check on `child.dying` to the top of the `subscriptionConsumer` loop where we are guaranteed to have the ownership token. Combine that check with the 'new subcriptions' check into an `updateSubscriptions` helper method. The diff is huge because this lets us drop an indentation level in `handleResponses`, I suggest reviewing with `w=1` to ignore whitespace.

Use an optimized crc32 library which is faster

8fa523b

eapache mentioned this pull request Aug 31, 2015

Optimize ChecksumIEEE helper klauspost/crc32#2

Merged

eapache mentioned this pull request Aug 31, 2015

Fix and simplify message decoding #528

Merged

eapache added a commit that referenced this pull request Aug 31, 2015

Merge pull request #527 from Shopify/crc32-optimization

2d4c75a

Use an optimized crc32 library which is faster

eapache merged commit 2d4c75a into master Aug 31, 2015

eapache deleted the crc32-optimization branch August 31, 2015 16:37

eapache mentioned this pull request Aug 31, 2015

Fix consumer race panic on close #529

Merged

eapache mentioned this pull request Jan 12, 2016

Sarama Snappy Compression #593

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use an optimized crc32 library which is faster #527

Use an optimized crc32 library which is faster #527

eapache commented Aug 31, 2015

wvanbergen commented Aug 31, 2015

eapache commented Aug 31, 2015

wvanbergen commented Aug 31, 2015

eapache commented Aug 31, 2015

eapache commented Aug 31, 2015

eapache commented Aug 31, 2015

eapache commented Aug 31, 2015

wvanbergen commented Aug 31, 2015

Use an optimized crc32 library which is faster #527

Use an optimized crc32 library which is faster #527

Conversation

eapache commented Aug 31, 2015

wvanbergen commented Aug 31, 2015

eapache commented Aug 31, 2015

wvanbergen commented Aug 31, 2015

eapache commented Aug 31, 2015

eapache commented Aug 31, 2015

eapache commented Aug 31, 2015

eapache commented Aug 31, 2015

wvanbergen commented Aug 31, 2015