Benchmarks #7

whoahbot · 2022-02-09T00:35:11Z

Overview

This PR adds the framework for a benchmarking suite. The benchmark that is in this PR is just intended as an example of benchmarking Python vs Bytewax, and should be replaced with a suitable example.

Running the benchmarks

Running cargo bench normally will result in linker errors due to this issue, so to run the benchmarks, use:

cargo bench --no-default-features

Adds execute_directly() to Executor to specify a number of threads to run on.
Adds criterion library

- Adds `execute_directly()` to `Executor` to specify a number of threads to run on. - Adds criterion library

davidselassie

Looks like a good start! We can iterate on the entry point stuff and further hone this.

davidselassie · 2022-02-09T18:18:07Z

benches/benchmarks/wordcount_bytewax.py

+def acc(word_to_count, words):
+    for word in words:
+        if word not in word_to_count:
+            word_to_count[word] = 0


Should be able to drop this if since that's what defaultdict does.

Good catch, thanks!

davidselassie · 2022-02-09T18:19:38Z

benches/benchmarks/wordcount_bytewax.py

+exec = bytewax.Executor()
+flow = exec.Dataflow(file_input())
+flow.flat_map(tokenize)
+flow.accumulate(lambda: defaultdict(int), acc)


I'm actually curious what changing this to reduce_epoch does to performance since we're leaning more on Rust code to do the grouping and collecting, but there's more back and forth between Py-Rust.

Agreed! I'll let you know what I find out.

Benchmarks

a8817a9

- Adds `execute_directly()` to `Executor` to specify a number of threads to run on. - Adds criterion library

whoahbot requested review from blakestier and davidselassie February 9, 2022 00:35

davidselassie approved these changes Feb 9, 2022

View reviewed changes

whoahbot merged commit 4ba41b1 into main Feb 10, 2022

whoahbot deleted the benchmarks branch February 10, 2022 00:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmarks #7

Benchmarks #7

whoahbot commented Feb 9, 2022

davidselassie left a comment

davidselassie Feb 9, 2022

whoahbot Feb 9, 2022

davidselassie Feb 9, 2022

whoahbot Feb 9, 2022

Benchmarks #7

Benchmarks #7

Conversation

whoahbot commented Feb 9, 2022

Overview

Running the benchmarks

davidselassie left a comment

Choose a reason for hiding this comment

davidselassie Feb 9, 2022

Choose a reason for hiding this comment

whoahbot Feb 9, 2022

Choose a reason for hiding this comment

davidselassie Feb 9, 2022

Choose a reason for hiding this comment

whoahbot Feb 9, 2022

Choose a reason for hiding this comment