Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Backport CASSANDRA-10340 and CASSANDRA-13038 #27

Merged
merged 3 commits into from
Apr 10, 2017

Conversation

wulczer
Copy link

@wulczer wulczer commented Apr 10, 2017

These two changes should make streaming operations faster, which would speed up bootstrapping and node moves.

mebigfatguy and others added 3 commits April 10, 2017 12:50
In ttl-heavy use cases (especially tables with default time to live set), the
streaming histograms calculated during compaction and streaming are very inefficient.

This patch addresses that in two ways:
1) It creates a system property -Dcassandra.streaminghistogram.roundseconds=60,
and rounds all histograms to the next highest multiple of that value, and
2) Rather than maintaining a histogram of 100 bins that have to be merged
on every new value, we keep a temporary spool of 100000 bins, and merge
down to the 100 bin final histogram only when the temporary spool overflows.

Patch by Jeff Jirsa; Reviewed by Nate McCall for CASSANDRA-13038

Upstream commit: a5ce963
patch by Stefan Podkowinski; reviewed by jasobrown for CASSANDRA-13040

Upstream commit: b8ed255
@wulczer
Copy link
Author

wulczer commented Apr 10, 2017

cc @derekroller @intjonathan

@intjonathan intjonathan merged commit 4797548 into newrelic-3.7 Apr 10, 2017
@wulczer wulczer deleted the jan/backport-10340-and-13038 branch April 10, 2017 17:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants