Refactor serialization of benchmarks #1131

syl20bnr · 2024-01-11T03:09:11Z

See changes below for a list of changes.

I will add an "upload" function to the persistence module in a further PR.

The current naming of the saved files need a bit of work, currently I just do benchmarks_<timestamp>.json but we could have some race condition and get some files overridden. --> We updated it to bench_<name>_<timestamp>.

The computed values are saved in seconds, maybe we should save them in milliseconds ? --> We changed to microseconds.

You can see an example of produced file here: https://gist.github.com/syl20bnr/b0fdbbf0c3877b9e4ddf262a9cc388ac

Checklist

Confirmed that run-checks all script has been executed.
Made sure the book is up to date with changes in this PR.

Related Issues/PRs

Being able to collect backend-comparison benchmarks #1072

Changes

flatten benchmarks data to make it easier to save documents to a database and query them
split some information into their own fields like backend and device
add new serialized info:
- computed values (mean, median, variance, min, max)
- number of samples
- operation name
- tensor shapes if any
serialize to separate files, one file per benchmark run
simplify persistence module to only a save method
remove remaining num_repeats in benches execute functions.

Testing

Tested serialization to disk with all the available benchmarks.

* flatten benchmarks data to make it easier to save documents to a database and query them * split some information into their own fields like backend and device * add new seralized info: - computed values (mean, median, variance, min, max) - number of samples - operation name - tensor shapes if any * serialize to separate files, one file per benchmark run * simplify persistence module to only a save method

syl20bnr · 2024-01-11T03:18:59Z

For the .expect() messages, I am not sure what to put in them. I see a lot of people putting an explanation of the error there and also sometimes people putting what is expected to work. The word "expect" suggests the latter but that's not what I see online most of the time 🤷🏻‍♂️ What's our code guideline on this ?

codecov · 2024-01-11T03:25:00Z

Codecov Report

Attention: 80 lines in your changes are missing coverage. Please review.

Comparison is base (f43b686) 85.65% compared to head (94a214b) 85.67%.
Report is 3 commits behind head on main.

Files	Patch %	Lines
backend-comparison/src/persistence/base.rs	0.00%	51 Missing ⚠️
burn-common/src/benchmark.rs	73.33%	28 Missing ⚠️
burn-compute/src/tune/tune_benchmark.rs	0.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1131      +/-   ##
==========================================
+ Coverage   85.65%   85.67%   +0.02%     
==========================================
  Files         513      513              
  Lines       56816    56987     +171     
==========================================
+ Hits        48665    48823     +158     
- Misses       8151     8164      +13

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

syl20bnr · 2024-01-11T14:40:19Z

I updated the file name with a bench name and the first 8 chars of a random uuid, examples:

benchmarks_gelu_1b2bb391.json
benchmarks_matmul_f79409f7.json

The name is passed as an argument to the save function because each ran bench in a benchmark can have a different name and operation used. For instance with gelu where we have 3 batches of benches with different gelus.

louisfd · 2024-01-11T14:43:22Z

For the .expect() messages, I am not sure what to put in them. I see a lot of people putting an explanation of the error there and also sometimes people putting what is expected to work. The word "expect" suggests the latter but that's not what I see online most of the time 🤷🏻‍♂️ What's our code guideline on this ?

From here: If you’re having trouble remembering how to phrase expect error messages remember to focus on the word “should” as in “env variable should be set by blah” or “the given binary should be available and executable by the current user”.

So I just stick to the word should and it works out

louisfd

It's very good in general. The only issue I see is how often we must tell the name of the benchmark (see my comments for example on unary). Could it not be unified in one place?

backend-comparison/benches/unary.rs

nathanielsimard

Instead of adding the new field "operation," we could add a field "ID" instead. "Name" is more generic, and we may have some benchmarks that aren't about single operations. Or maybe we need another structure:

name (Could have a default implementation core::any::type_name::<Self>().to_string())
shapes
options (num_repeat, kind, etc.)

The ID could be generated from the concatenation of all of the above.

backend-comparison/benches/binary.rs

backend-comparison/src/persistence/base.rs

syl20bnr · 2024-01-11T15:10:44Z

It's very good in general. The only issue I see is how often we must tell the name of the benchmark (see my comments for example on unary). Could it not be unified in one place?

You are right, I thought about it but I cannot fetch the binary name at runtime, the only references are in Cargo, in the command line, and the source file name. I'll try to find something.

syl20bnr · 2024-01-11T21:35:37Z

Ready for another round of review.
Updated gist: https://gist.github.com/syl20bnr/b0fdbbf0c3877b9e4ddf262a9cc388ac

Files on disk naming examples:

  bench_binary_1705007825641.json
  bench_from_data_1705007836909.json
  bench_gelu_reference_1705007872731.json
  bench_gelu_withcustomerf_1705007872987.json
  bench_gelu_withreferenceerf_1705007872929.json
  bench_matmul_1705007855627.json
  bench_to_data_1705007832378.json
  bench_unary_1705007818349.json

I added num_repeats also to the documents, I believe they can be useful.

nathanielsimard

I would not save the num_repeat in the database for now. I plan on eventually removing it since it is often confused with num_samples. It was useful when we didn't have a reliable way to sync a backend without actually reading the data. Repeating an execution meant that the data reading was less impactful.

backend-comparison/benches/custom_gelu.rs

syl20bnr · 2024-01-12T14:45:05Z

@nathanielsimard I already save the number of samples. See the gist.

I believe the repeat can still be useful, check the docstring I added to it, I think it is a good concept to have:

    /// Number of executions per sample

Say we have a very quick operation to bench, repeating it to get a meaningful sample can be useful.

syl20bnr · 2024-01-12T14:45:59Z

Ideally the notion of repeat could be integrated into the main loop but I tried it and it was not as trivial to do.

syl20bnr · 2024-01-12T15:01:54Z

We decided to remove the num_repeats completely from the benchmarks.

nathanielsimard

🎉

Remove operations field Correctly create one file per ran benchmark

louisfd

That's very nice, I think we reached the beautiful code stage.
I have only two minor requests and then it's good to go

burn-common/src/benchmark.rs

syl20bnr requested review from louisfd and nathanielsimard January 11, 2024 03:09

Update bench save file format to use name and uuid

d54e190

louisfd requested changes Jan 11, 2024

View reviewed changes

backend-comparison/benches/unary.rs Outdated Show resolved Hide resolved

backend-comparison/benches/unary.rs Outdated Show resolved Hide resolved

backend-comparison/benches/unary.rs Outdated Show resolved Hide resolved

nathanielsimard requested changes Jan 11, 2024

View reviewed changes

syl20bnr force-pushed the benchmark_persistence_refactor branch from 4bd85a1 to 7a8bc99 Compare January 11, 2024 20:24

syl20bnr requested review from nathanielsimard and louisfd January 11, 2024 21:31

nathanielsimard requested changes Jan 12, 2024

View reviewed changes

backend-comparison/benches/custom_gelu.rs Outdated Show resolved Hide resolved

syl20bnr force-pushed the benchmark_persistence_refactor branch from 902441b to 000af88 Compare January 12, 2024 14:54

syl20bnr requested a review from nathanielsimard January 12, 2024 15:10

nathanielsimard approved these changes Jan 12, 2024

View reviewed changes

syl20bnr added 8 commits January 12, 2024 10:18

Compute serialized fields count automatically via a macro

da663b6

Rework naming of benchmarks, shapes and add options field

1038396

Remove operations field Correctly create one file per ran benchmark

Serialize benchmark num_repeats

83bfc29

Fix expect message to follow the 'should' convention

fca5309

Cargo fmt :-)

f953dc1

Make Clippy happy

cc29afb

Save files in the burn subdirectory

04952e3

Change name of custom_gelu bench to just gelu

35dcfef

Remove num_repeats from backend-comparison benchmarks

278984c

syl20bnr force-pushed the benchmark_persistence_refactor branch from 4727dce to 278984c Compare January 12, 2024 15:18

louisfd requested changes Jan 12, 2024

View reviewed changes

burn-common/src/benchmark.rs Outdated Show resolved Hide resolved

burn-common/src/benchmark.rs Outdated Show resolved Hide resolved

syl20bnr added 2 commits January 12, 2024 10:54

Fix wrong variable name to compute the median

8220070

Remove false positive possibility in test_mean_duration

94a214b

louisfd approved these changes Jan 12, 2024

View reviewed changes

syl20bnr merged commit 9bd2d7b into main Jan 12, 2024
14 of 15 checks passed

syl20bnr deleted the benchmark_persistence_refactor branch February 13, 2024 16:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor serialization of benchmarks #1131

Refactor serialization of benchmarks #1131

syl20bnr commented Jan 11, 2024 •

edited

syl20bnr commented Jan 11, 2024

codecov bot commented Jan 11, 2024 •

edited

syl20bnr commented Jan 11, 2024 •

edited

louisfd commented Jan 11, 2024

louisfd left a comment

nathanielsimard left a comment

syl20bnr commented Jan 11, 2024

syl20bnr commented Jan 11, 2024

nathanielsimard left a comment

syl20bnr commented Jan 12, 2024

syl20bnr commented Jan 12, 2024 •

edited

syl20bnr commented Jan 12, 2024 •

edited

nathanielsimard left a comment

louisfd left a comment

Refactor serialization of benchmarks #1131

Refactor serialization of benchmarks #1131

Conversation

syl20bnr commented Jan 11, 2024 • edited

Checklist

Related Issues/PRs

Changes

Testing

syl20bnr commented Jan 11, 2024

codecov bot commented Jan 11, 2024 • edited

Codecov Report

syl20bnr commented Jan 11, 2024 • edited

louisfd commented Jan 11, 2024

louisfd left a comment

Choose a reason for hiding this comment

nathanielsimard left a comment

Choose a reason for hiding this comment

syl20bnr commented Jan 11, 2024

syl20bnr commented Jan 11, 2024

nathanielsimard left a comment

Choose a reason for hiding this comment

syl20bnr commented Jan 12, 2024

syl20bnr commented Jan 12, 2024 • edited

syl20bnr commented Jan 12, 2024 • edited

nathanielsimard left a comment

Choose a reason for hiding this comment

louisfd left a comment

Choose a reason for hiding this comment

syl20bnr commented Jan 11, 2024 •

edited

codecov bot commented Jan 11, 2024 •

edited

syl20bnr commented Jan 11, 2024 •

edited

syl20bnr commented Jan 12, 2024 •

edited

syl20bnr commented Jan 12, 2024 •

edited