Add (new) mpl benchmarks #404

Firobe · 2022-10-21T15:14:12Z

Close #401

Add msort_ints, msort_strings, primes, tokens, raytracer from https://github.com/MPLLang/parallel-ml-bench/tree/main/ocaml/bench to the parallel benchs (with 1, 2, 4, 8, 16, 32 cores each) on Turing and Navajo.

Sudha247

Thanks for adding the benchmarks @Firobe!

The config files have the correct benchmarks added to them. But, the benchmarks/mpl/bench directory has more benchmarks that are already present in sandmark. Perhaps we can remove the already available ones? You can see the existing parallel benchmarks in multicore-numerical and other directories with multicore-prefix.

multicore_parallel_navajo_run_config.json

multicore_parallel_run_config.json

Firobe · 2022-10-27T08:22:13Z

I think it should now be good to go!

Sudha247

Thanks for the updates. Looks good to me!

shakthimaan · 2022-11-01T17:03:39Z

Thanks for the review! LGTM.

kayceesrk · 2023-01-10T05:49:25Z

Do these benchmarks appear on the sandmark nightly results? @shakthimaan

If not, can you make them appear on the nightly results?

Sudha247 · 2023-01-10T05:54:27Z

We should add a macro_bench tag for them to appear in sandmark-nightly, I think.

kayceesrk · 2023-01-10T09:41:56Z

Do we know that these benchmarks deserve macro_bech tag? In particular, do they run for enough time?

kayceesrk · 2023-01-10T10:08:38Z

One way to find out is to add the macro_bench tag and then observe the results on the nightly results and tune them appropriately.

Sudha247 · 2023-01-10T10:14:12Z

All the benchmarks have a gt_100s tag, so I take it as they run for more than 100s at least for the one-core version. In that case, we can safely tag them as macro_bench.

kayceesrk · 2023-01-10T10:22:05Z

In addition to the missing macro_bench tag, the benchmarks are not in the right format to be visualized. In particular for a benchmark to be visualized,

The benchmark should have a serial version with the name foo.
The benchmark should have a parallel version with the name foo_multicore
- IMPORTANT: Parallel version running on 1 domain is not the same as the sequential version.
The first argument (params or short_name) should be the number of domains. There can be other arguments that follow. When using short_name, the format should be <num_domains>_<other_arg> with an _ separating the number of domains and other arguments.

I also think this is the reason why graph500 results also don't appear in the nightly results.

It appears that this part of the codebase needs a little bit of love and attention. Someone should:

Review all of the current parallel benchmarks to see whether each of the benchmarks that appear to be tagged as macro_bench does appear in the nightly run.
Make an issue with all the benchmarks tagged as macro_bench that doesn't appear in the nightly runs.
Triage each one to identify what the problem is.
Update the documentation so that the contributors know what is the expectation (see previous list).
- Perhaps convert this to a checklist that the contributors and the reviewers can use.
- Better built tooling that can catch this.

Unfortunately, the current state is less than ideal. We don't yet have the tools/process to review and catch these before this PR and similar others are merged.

punchagan · 2023-01-10T10:35:44Z

Thanks for the review of the benchmarks and the notes. I have not looked at the new mpl benchmarks in question in this PR, but I agree that the current state of the benchmarks should be improved.

I also think this is the reason why graph500 results also don't appear in the nightly results.

graph500 results were disabled because the benchmark implementation was not scalable, and we have disabled the results from showing up in the UI until we fix the scalability.

Building a checklist for the contributors and/or reviewers would be a good first step, and then some tooling around making sure at least the easily verifiable checklist requirements are being followed.

kayceesrk · 2023-01-10T10:38:26Z

Building a checklist for the contributors and/or reviewers would be a good first step, and then some tooling around making sure at least the easily verifiable checklist requirements are being followed.

Tooling here is a bit tricky. One could possibly build this as a jq script that checks for a given benchmark whether serial version exists followed by parallel benchmark named appropriately, etc.

graph500 results were disabled because the benchmark implementation was not scalable, and we have disabled the results from showing up in the UI until we fix the scalability.

Thanks for the clarification here.

Firobe · 2023-01-10T13:10:29Z

Okay, I'm going to update the mpl benchmarks with the proper tags and naming conventions, and document the expectations w.r.t. naming KC listed in the README.

kayceesrk · 2023-01-10T14:12:29Z

Thanks @Firobe 👍

Firobe · 2023-01-13T13:36:30Z

See #439

Firobe force-pushed the mpl-benchmarks branch 4 times, most recently from c8e1ca8 to c071892 Compare October 24, 2022 11:49

Firobe marked this pull request as ready for review October 24, 2022 11:50

Firobe changed the title ~~[WIP] Add (new) mpl benchmarks~~ Add (new) mpl benchmarks Oct 24, 2022

Firobe force-pushed the mpl-benchmarks branch 2 times, most recently from 6a57471 to 44a9657 Compare October 24, 2022 16:11

Sudha247 requested changes Oct 25, 2022

View reviewed changes

multicore_parallel_navajo_run_config.json Outdated Show resolved Hide resolved

multicore_parallel_run_config.json Outdated Show resolved Hide resolved

Add (new) mpl benchmarks

997c358

Firobe force-pushed the mpl-benchmarks branch from 44a9657 to 997c358 Compare October 26, 2022 20:15

Merge branch 'ocaml-bench:main' into mpl-benchmarks

3953472

Firobe force-pushed the mpl-benchmarks branch from 2c6c991 to 3953472 Compare October 27, 2022 12:08

Sudha247 approved these changes Oct 31, 2022

View reviewed changes

Sudha247 requested a review from shakthimaan October 31, 2022 07:34

shakthimaan merged commit f99a047 into ocaml-bench:main Nov 1, 2022

Firobe deleted the mpl-benchmarks branch November 1, 2022 22:10

Firobe mentioned this pull request Jan 13, 2023

Include MPL benchmarks in the nightly pipeline #439

Merged

punchagan mentioned this pull request May 3, 2023

A number of parallel benchmarks seem to be broken #446

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add (new) mpl benchmarks #404

Add (new) mpl benchmarks #404

Firobe commented Oct 21, 2022 •

edited

Loading

Sudha247 left a comment

Firobe commented Oct 27, 2022

Sudha247 left a comment

shakthimaan commented Nov 1, 2022

kayceesrk commented Jan 10, 2023

Sudha247 commented Jan 10, 2023

kayceesrk commented Jan 10, 2023

kayceesrk commented Jan 10, 2023

Sudha247 commented Jan 10, 2023

kayceesrk commented Jan 10, 2023 •

edited

Loading

punchagan commented Jan 10, 2023

kayceesrk commented Jan 10, 2023

Firobe commented Jan 10, 2023

kayceesrk commented Jan 10, 2023

Firobe commented Jan 13, 2023

Add (new) mpl benchmarks #404

Add (new) mpl benchmarks #404

Conversation

Firobe commented Oct 21, 2022 • edited Loading

Sudha247 left a comment

Choose a reason for hiding this comment

Firobe commented Oct 27, 2022

Sudha247 left a comment

Choose a reason for hiding this comment

shakthimaan commented Nov 1, 2022

kayceesrk commented Jan 10, 2023

Sudha247 commented Jan 10, 2023

kayceesrk commented Jan 10, 2023

kayceesrk commented Jan 10, 2023

Sudha247 commented Jan 10, 2023

kayceesrk commented Jan 10, 2023 • edited Loading

punchagan commented Jan 10, 2023

kayceesrk commented Jan 10, 2023

Firobe commented Jan 10, 2023

kayceesrk commented Jan 10, 2023

Firobe commented Jan 13, 2023

Firobe commented Oct 21, 2022 •

edited

Loading

kayceesrk commented Jan 10, 2023 •

edited

Loading