Simple benchmarks of a few topologies #52

michaelfairley · 2018-12-11T16:13:03Z

Closes #39.

These take about a minute to run on my main development machine, which seems like a decent compromise between thoroughness and iteration speed. Their speed can be tweaked with num_lines or sample_size (to change either how much data a single benchmark handles or how many times the benchmark is run).

I focused on a few topologies that I expect to have different bottlenecks or be impacted by different types code changes. Let me know if there are any other setups that you think are worth including in here.

One thing that's not in here that I think is probably worth trying at some point: Having some benchmarks with artificial sinks/sources that don't use the network (e.g. a random line generator sink) and use those to detect changes that would otherwise be hidden behind network bottlenecks (or even just the noise of using the network).

The code in these is not super pretty. If you see any easy wins on how to clean them up at all, I would love that feedback.

These are not the purest benchmarks in the world (e.g. the harness itself takes up resources, and the network traffic between the harness and the system-under-test might be a bottleneck at times), but it seems decent enough to identify significant changes in either direction.

Instructions for using this thing:
cargo bench will run it (and report the time difference between the previous run of cargo bench).

After cargo installing critcmp, it can be used to compare non-consecutive run:

cargo bench --bench bench -- --save-baseline before
# Make changes that impact performance
cargo bench --bench bench -- --save-baseline after
critcmp before after --list

(An example of this in action in #53.)

It also seems like we're off to a very good start with our performance. pipe (100 byte lines) is doing 16.1 MB/sec, and pipe_with_huge_lines (100kb lines) is getting 288.4 MB/sec.

LucioFranco

Took a quick look mostly cuz I was curious but this is looking fantastic!

LucioFranco · 2018-12-11T16:36:47Z

benches/bench.rs

+                .and_then(|sink| {
+                    // This waits for FIN from the server so we don't start shutting it down before it's fully received the test data
+                    let socket = sink.into_inner().into_inner();
+                    socket.shutdown(std::net::Shutdown::Write).unwrap();


Curious why you decided not to use sink.close()?

There's an annoying and complicated race condition here:
If we use sink.close(), it returns immediately (without waiting for the server to acknowledge that the connection was closed), and the only guarantee we have is that the packets are at least queued up in the kernel's networking buffers. Since the very thing that happens after this is shutting the server down, there a chance that we could actually shut the server down before it's started processing its input.
Shutting down the write half of the socket sends FIN to the server letting it know the client is done sending data, and once it's read everything the client has sent, it'll respond with its own FIN (which the reads on the lines after this wait for), letting us be sure that the server has at least picked up all of the input before we start shutting it down.

Ah that makes sense, what about this https://docs.rs/tokio/0.1.13/tokio/io/trait.AsyncWrite.html#tymethod.shutdown?

Oh sweet! I've never noticed that. Cleaned up a bit with that in 39bd742

lukesteensen · 2018-12-11T17:14:03Z

🎉 This looks awesome! One thing I'd be curious to see is a comparison against something that's built statically with the stream and sink combinators. That way we can get a rough idea of what (if any) price we're paying for the dynamic topology stuff.

Added OWNERS file

Simple benchmarks of a few topologies

72e470a

michaelfairley requested a review from lukesteensen December 11, 2018 16:13

LucioFranco reviewed Dec 11, 2018

View reviewed changes

michaelfairley merged commit 112ec06 into master Dec 11, 2018

michaelfairley deleted the benchmarking branch December 11, 2018 19:25

lukesteensen mentioned this pull request Dec 11, 2018

Automated performance testing #48

Closed

syedriko referenced this pull request in syedriko/vector Jun 1, 2022

Merge pull request #52 from vimalk78/add-owners

bc0aa16

Added OWNERS file

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simple benchmarks of a few topologies #52

Simple benchmarks of a few topologies #52

michaelfairley commented Dec 11, 2018 •

edited

LucioFranco left a comment

LucioFranco Dec 11, 2018

michaelfairley Dec 11, 2018

LucioFranco Dec 11, 2018

michaelfairley Dec 11, 2018

lukesteensen commented Dec 11, 2018

Simple benchmarks of a few topologies #52

Simple benchmarks of a few topologies #52

Conversation

michaelfairley commented Dec 11, 2018 • edited

LucioFranco left a comment

Choose a reason for hiding this comment

LucioFranco Dec 11, 2018

Choose a reason for hiding this comment

michaelfairley Dec 11, 2018

Choose a reason for hiding this comment

LucioFranco Dec 11, 2018

Choose a reason for hiding this comment

michaelfairley Dec 11, 2018

Choose a reason for hiding this comment

lukesteensen commented Dec 11, 2018

michaelfairley commented Dec 11, 2018 •

edited