Use binary search to pick bucket. #79

beorn7 · 2015-02-19T18:52:05Z

With the usual number of buckets, this doesn't really make a
difference, but it should scale...

brian-brazil · 2015-02-19T19:35:31Z

Do you have a benchmark for this?

Branch misprediction may cause this to be sub-optimal.

juliusv · 2015-02-19T19:59:22Z

Yeah, would be interesting to see at what bucket counts this actually amortizes (very large, I suspect) and what the time overhead is for more normal bucket counts.

beorn7 · 2015-02-19T22:33:40Z

Benchmarks are in the code. The difference is within the statistical noise for the very low bucket count. So it is not measurable slower, but one day, somebody wants to have 10,000 buckets, and that will be my moment of triumph...

juliusv · 2015-02-19T23:22:01Z

Hmm, in a totally sequential use case like this:

func BenchmarkHistogramSerial(b *testing.B) {
  b.StopTimer()
  s := NewHistogram(HistogramOpts{})

  b.StartTimer()
  for i := 0; i < b.N; i++ {
    s.Observe(<insert value here>)
  }
}

For the old code, I get 55ns/op for observing 0 values and 65ns/ops for observing 10.

For the new code, I get 88ns/op pretty much no matter which value I observe in the bucket range.

beorn7 · 2015-02-20T09:53:41Z

Benchmarks are already provided in the code. I ran them and could not see any difference except noise. The reason probably is that the benchmarks in the code do a bit more (many goroutines in parallel, observe values from an array of samples, not just a constant value) so that more things add to the time that do not depend on the search strategy.

Extrapolating from your result above, the break even would be reached at about 40 buckets, which is still quite realistic. We can try that out. I also want to accommodate the case with 100 or 1000 buckets, even if I have to pay 20ns penalty per observation for cases with few buckets. (We should try that out. Once I'm in...) We were joking about the death spiral (latencies increase, and now even the time to Observe() those latencies increase...). But should those 20ns really matter, than we really don't want to have the observe time depend on the observed value.

beorn7 · 2015-02-20T11:42:22Z

So there is actually a pretty "pure" microbenchmark in the code: BenchmarkHistogramObserve1

My results: 27.1 ns/op with linear search, 33.6 ns/op with binary search.
That's default bucket count, and the observed values are increasing, i.e. most of them end up in the highest bucket.

If it were only for me, I'd consider these changes irrelevant in practice and go for binary search just to not get linearly increasing observe times with higher bucket counts.

If you insist, I will run benchmarks with higher bucket counts (and 'fairer' conditions where most of the observations happen in the middle buckets). I'll find the break-even point and will implement a switch to the most efficient search method depending on bucket count. (I just thing I have more pressing things to do...)

juliusv · 2015-02-20T13:17:38Z

Agreed, that's something we can do later. Let's just merge this for now, but keep in mind it's something that can be optimized later, if needed.

👍

beorn7 · 2015-02-20T14:42:39Z

I'll add a TODO for that.

beorn7 · 2015-02-20T14:49:47Z

I couldn't resist and ran a benchmark (BenchmarkHistogramNoLabels, which is almost exactly like Julius's benchmark above).
11 buckets: 38.3 ns/op linear - binary 48.7 ns/op
100 buckets: 78.1 ns/op linear - binary 54.9 ns/op
300 buckets: 154 ns/op linear - binary 61.6 ns/op

With the usual number of buckets, this doesn't really make a difference, but it should scale... See the added TODO for the precise numbers.

Use binary search to pick bucket.

Use binary search to pick bucket.

79efd06

With the usual number of buckets, this doesn't really make a difference, but it should scale... See the added TODO for the precise numbers.

beorn7 force-pushed the beorn7/histogram branch from 9e46d41 to 79efd06 Compare February 20, 2015 14:54

beorn7 added a commit that referenced this pull request Feb 20, 2015

Merge pull request #79 from prometheus/beorn7/histogram

bc1f2b2

Use binary search to pick bucket.

beorn7 merged commit bc1f2b2 into master Feb 20, 2015

beorn7 deleted the beorn7/histogram branch February 20, 2015 14:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use binary search to pick bucket. #79

Use binary search to pick bucket. #79

Uh oh!

beorn7 commented Feb 19, 2015

Uh oh!

brian-brazil commented Feb 19, 2015

Uh oh!

juliusv commented Feb 19, 2015

Uh oh!

beorn7 commented Feb 19, 2015

Uh oh!

juliusv commented Feb 19, 2015

Uh oh!

beorn7 commented Feb 20, 2015

Uh oh!

beorn7 commented Feb 20, 2015

Uh oh!

juliusv commented Feb 20, 2015

Uh oh!

beorn7 commented Feb 20, 2015

Uh oh!

beorn7 commented Feb 20, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Use binary search to pick bucket. #79

Use binary search to pick bucket. #79

Uh oh!

Conversation

beorn7 commented Feb 19, 2015

Uh oh!

brian-brazil commented Feb 19, 2015

Uh oh!

juliusv commented Feb 19, 2015

Uh oh!

beorn7 commented Feb 19, 2015

Uh oh!

juliusv commented Feb 19, 2015

Uh oh!

beorn7 commented Feb 20, 2015

Uh oh!

beorn7 commented Feb 20, 2015

Uh oh!

juliusv commented Feb 20, 2015

Uh oh!

beorn7 commented Feb 20, 2015

Uh oh!

beorn7 commented Feb 20, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants