Add detailed CPU benchmark #1188

marcotc · 2020-09-28T19:08:47Z

This PR adds CPU profiling using ruby-prof .

All our existing benchmarks only provide deep analysis for memory behavior. This PR introduces a detailed analysis tool to measure application timing.

By default, the results are output in Cachegrind format, and can be analyzed with tools like KCachegrind or QCachegrind.
Here's an example:

When the benchmark runs, instructions are printed on how to get to this colorful screen I posted above.

marcotc · 2020-09-28T19:13:41Z

spec/ddtrace/benchmark/microbenchmark_spec.rb

      after { tracer.shutdown! }

+      let(:writer) { Datadog::Writer.new(buffer_size: 1000, flush_interval: 0) }


This ensures that spans are actually being consumed quickly by the writer, instead of being mostly dropped by the buffer due to the 1000 span default limit.

Our benchmarks aim to simulate a realistic user scenario, but as quickly as possible. Having the worker flush spans more frequently helps us accomplish that goal.

marcotc · 2020-09-28T19:15:32Z

spec/ddtrace/benchmark/support/benchmark_helper.rb


-    # Warm up
+  def warm_up


Warm up was previously being done on a before RSpec block, which could run at any point of the setup phase, which can cause issues when benchmark setups are required to run before the warming up starts.

We now moved warm_up to inside the test run block, which ensures that it runs after all setup is done.

marcotc · 2020-09-28T19:17:28Z

spec/ddtrace/benchmark/support/benchmark_helper.rb

+      # Read HTTP request to allow other side to have enough
+      # buffer write room. If we don't, the client won't be
+      # able to send the full request until the buffer is cleared.
+      conn.read(1 << 31 - 1)


This is happening because, in the end-to-end test, we are trying to send very large payloads, due to the buffer constantly having close to 1000 items.

The network write buffer was getting full on the worker side, which would just block and not continue flushing until it timed out.

ericmustin

Lgtm. Maybe we want to include a line or small section development docs about using it, so 3rd party contribs know to use tools like KCachegrind or QCachegrind to view the results, or generally any things we want a contributor to pay attention to in the results?

marcotc · 2020-09-29T20:32:45Z

@ericmustin I added a section to our developer guide about benchmarks.
I believe these instructions should be available in our Pull Request templates, when we implement them.

Benchmark specific instructions, like how to process results in this PR, are printed in each benchmark run as they are potentially different for each benchmark result. It would be hard to keep our markdown file and benchmark instructions in sync, while printing them with the benchmarks makes that part easier.

* Add CPU benchmark * Remove incompatible ruby-prof for JRuby runs * Remove incompatible 'ruby-prof' for Ruby < 2.4 runs * Add benchmarks to developer guide

Add CPU benchmark

310d824

marcotc added the performance Involves performance (e.g. CPU, memory, etc) label Sep 28, 2020

marcotc requested a review from a team September 28, 2020 19:08

marcotc self-assigned this Sep 28, 2020

marcotc added 2 commits September 28, 2020 15:09

Remove incompatible ruby-prof for JRuby runs

e082afa

Remove incompatible 'ruby-prof' for Ruby < 2.4 runs

dffd678

marcotc commented Sep 28, 2020

View reviewed changes

marcotc changed the title ~~Add CPU benchmark~~ Add detailed CPU benchmark Sep 28, 2020

ericmustin previously approved these changes Sep 29, 2020

View reviewed changes

Add benchmarks to developer guide

232e3e8

marcotc dismissed ericmustin’s stale review via 232e3e8 September 29, 2020 20:30

marcotc requested a review from ericmustin September 29, 2020 20:32

ericmustin approved these changes Oct 6, 2020

View reviewed changes

marcotc merged commit fbb068a into master Oct 7, 2020

marcotc added this to the 0.42.0 milestone Oct 7, 2020

ivoanjo deleted the perf/cpu-bench branch July 16, 2021 09:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add detailed CPU benchmark #1188

Add detailed CPU benchmark #1188

marcotc commented Sep 28, 2020

marcotc Sep 28, 2020

marcotc Sep 28, 2020

marcotc Sep 28, 2020

ericmustin left a comment

marcotc commented Sep 29, 2020

		after { tracer.shutdown! }

		let(:writer) { Datadog::Writer.new(buffer_size: 1000, flush_interval: 0) }

Add detailed CPU benchmark #1188

Add detailed CPU benchmark #1188

Conversation

marcotc commented Sep 28, 2020

marcotc Sep 28, 2020

Choose a reason for hiding this comment

marcotc Sep 28, 2020

Choose a reason for hiding this comment

marcotc Sep 28, 2020

Choose a reason for hiding this comment

ericmustin left a comment

Choose a reason for hiding this comment

marcotc commented Sep 29, 2020