Proposal: Include histogram in JSON output #394

fxkr · 2019-04-23T05:37:51Z

Proposal

Add histogram data to the vegeta report -type json output, e.g.:

# vegeta report -type json -hist '[0,5ms,10ms,15ms]'
  ...
  "hist": [123, 125, 40, 80],
  ...

I'd be happy to write the code, but I'd like to run some design decisions by you first:

Under what key: don't care. hist matches CLI.
List (preferred) or map. It's guaranteed ordering and no questions about the format of the key ("min", "max", "min-max") vs self-descriptiveness.
Values in msec
Bin config:
- (preferred) Separate command line argument (-type json -hist [0,5ms,10ms,15ms])
- As a subsequent map (-type json'{"hist":["0","5ms","10ms","15ms"]}')
- As a subsequent array (-type json'[0,5ms,10ms,15ms]'). Similar to hist, but I don't like this approach because it's not extensible at all, and I don't want to have to deal with breaking changes later.
Either way I'd default to leaving out histogram data if no config is given for it.

Automatic logarithmic binning would be kinda neat (I love this in bcc), but I see it as out of scope.

Do you think that would be useful?

The main concern I have is that I am not 100% sure if recording the histogram is the best idea. Having just the three percentiles that vegeta puts in the JSON right now is too coarse, but we could just output more of those instead. I wonder if you have any thoughts on recording histograms vs (many points of the) CDF?

Background

I want to automatically benchmark a piece of software as part of a CI/CD pipeline and record the results to track how it changes over time. I want to record more than just the percentiles because the latency distribution can be strongly multimodal.

For example, depending on the concrete test we can hit (and possibly trash) or bypass certain caches (at different layers in our software), and having the histogram lets us see that very clearly visually.

Workarounds

Parse report -type hist human-readable output
Use vegeta as a library

Thanks for writing vegeta!

The text was updated successfully, but these errors were encountered:

Closes tsenart#394

fxkr · 2019-04-24T07:41:50Z

I went ahead and did it the way I think it's best; it was pretty easy. Let me know if you'd prefer it to be done differently.

tsenart · 2019-04-28T11:40:29Z

Automatic logarithmic binning would be kinda neat (I love this in bcc), but I see it as out of scope.

I'd LOVE this, really!

Closes tsenart#394

fxkr added a commit to fxkr/vegeta that referenced this issue Apr 24, 2019

report: Add -hist option for -type=json

2de0f61

Closes tsenart#394

fxkr mentioned this issue Apr 24, 2019

Include histogram in JSON output #395

Merged

3 tasks

fxkr added a commit to fxkr/vegeta that referenced this issue May 28, 2019

report: Add -buckets option for -type=json

e161de3

Closes tsenart#394

fxkr added a commit to fxkr/vegeta that referenced this issue May 28, 2019

report: Add -buckets option for -type=json

e0d39d3

Closes tsenart#394

fxkr added a commit to fxkr/vegeta that referenced this issue Jun 24, 2019

report: Add -buckets option for -type=json

4e4bc0f

Closes tsenart#394

tsenart closed this as completed in 2870712 Jun 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal: Include histogram in JSON output #394

Proposal: Include histogram in JSON output #394

fxkr commented Apr 23, 2019

fxkr commented Apr 24, 2019

tsenart commented Apr 28, 2019

Proposal: Include histogram in JSON output #394

Proposal: Include histogram in JSON output #394

Comments

fxkr commented Apr 23, 2019

Proposal

Background

Workarounds

fxkr commented Apr 24, 2019

tsenart commented Apr 28, 2019