Implement parallelism using rayon #110

Cadair · 2024-02-08T21:25:53Z

This is really pushing my rust skills, but I think it works.

Without parallelism enabled:

v1.2 (FORTRAN)
    Unnamed: 0  nseeds      time
0            0       1  0.094430
1            1       2  0.088924
2            2       4  0.092906
3            3       8  0.098172
4            4      16  0.094207
5            5      32  0.097889
6            6      64  0.098379
7            7     128  0.110099
8            8     256  0.139267
9            9     512  0.188014
10          10    1024  0.296180
11          11    2048  0.509781
v2.0 (Rust)
    Unnamed: 0  nseeds      time
0            0       1  0.003156
1            1       2  0.005489
2            2       4  0.011511
3            3       8  0.024024
4            4      16  0.048538
5            5      32  0.092113
6            6      64  0.183648
7            7     128  0.364994
8            8     256  0.729221
9            9     512  1.458128
10          10    1024  2.927491
11          11    2048  5.816854
v2.0 (Rust - Parallel)
    Unnamed: 0  nseeds      time
0            0       1  0.003991
1            1       2  0.003396
2            2       4  0.004035
3            3       8  0.006338
4            4      16  0.008281
5            5      32  0.016827
6            6      64  0.038427
7            7     128  0.066799
8            8     256  0.132020
9            9     512  0.257077
10          10    1024  0.509309
11          11    2048  1.011820

This is rebased on #111 - See 73ec541 for the relevant diff

codecov-commenter · 2024-02-13T21:28:59Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 96.82%. Comparing base (ae934de) to head (78e90c3).
Report is 14 commits behind head on main.

❗ Current head 78e90c3 differs from pull request most recent head ad122fe. Consider uploading reports for the commit ad122fe to get more accurate results

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #110   +/-   ##
=======================================
  Coverage   96.82%   96.82%           
=======================================
  Files           2        2           
  Lines         126      126           
=======================================
  Hits          122      122           
  Misses          4        4

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Will-Shanks

Moving my comments here from slack per @nabobalis request 😄 Hope that helps!

Will-Shanks · 2024-02-16T23:05:42Z

src/trace.rs

-                seeds.slice(s![i, ..]),
+    let all_lines: Vec<StreamlineResult> = seeds
+        .axis_iter(Axis(0))
+        .into_par_iter()


If you don't need/want the work stealing that rayon does, its a little more leg work to implement, but batching the seeds and using scoped_threads would probably be faster since it doesn't have the communication overhead.

src/trace.rs

nabobalis · 2024-02-17T00:20:57Z

Moving my comments here from slack per @nabobalis request 😄 Hope that helps!

Thank you @Will-Shanks !

Cadair · 2024-05-01T09:31:36Z

src/trace.rs

+            (result.status, result.line)
+        }).unzip();
+
+    let extracted_lines_views: Vec<ArrayView2<f64>> = extracted_lines.iter().map(|arr| arr.view()).collect();


@Will-Shanks If you don't mind me continuing to pick your brains, I finally came back to this.

I think I understood what you said about combining the things into a single map, but I couldn't figure out a way around this extra loop to convert the arrays into views which stack needed?

After playing around with it I agree with you, I couldn't come up with anything better that the borrow checker approved of.

let extracted_lines_views: Vec<ArrayView2<f64>> = - extracted_lines.iter().map(|arr| arr.view()).collect(); - let xs = stack(Axis(0), extracted_lines_views.as_slice()).unwrap(); - return (statuses, xs); + extracted_lines.par_iter().map(|arr| arr.view()).collect(); + let xs = stack(Axis(0), &extracted_lines_views).unwrap(); + (statuses, xs) }

if view() is lightweight (which I'm guessing it probably is) the iter/par_iter change probably isn't productive

I would hope extracted_lines_views.as_slice() and &extracted_lines_views end up as the same machine code, but I think the latter should be preferred.

The par_iter seems to have a marginal positive impact for large numbers of slices, and a large negative impact for smaller number of seeds:

So I think it's probably best without. &extracted_lines_views works though.

Cadair · 2024-05-01T10:09:27Z

My latest comparative benchmark (not to be directly compared against the one in the OP as it's a different machine):

Will-Shanks · 2024-05-02T18:15:20Z

@Cadair a small suggestion I would make is to run cargo fmt before committing, I also personally try and keep cargo clippy [--fix] happy as it often has good suggestions, and use cargo outdated -R and cargo udeps to help keep track of outdated/unused dependencies.

Of course this is mostly a personal preference so feel free to take it or leave it :)

nabobalis · 2024-05-02T18:24:31Z

@Cadair a small suggestion I would make is to run cargo fmt before committing, I also personally try and keep cargo clippy [--fix] happy as it often has good suggestions, and use cargo outdated -R and cargo udeps to help keep track of outdated/unused dependencies.

Of course this is mostly a personal preference so feel free to take it or leave it :)

Are these possible to do with a pre-commit config?

Will-Shanks · 2024-05-02T18:30:51Z

@Cadair a small suggestion I would make is to run cargo fmt before committing, I also personally try and keep cargo clippy [--fix] happy as it often has good suggestions, and use cargo outdated -R and cargo udeps to help keep track of outdated/unused dependencies.
Of course this is mostly a personal preference so feel free to take it or leave it :)

Are these possible to do with a pre-commit config?

I don't have that setup, but I don't see why you couldn't. I'd probably only do that for cargo fmt as the other 3 can take a minute to run, you could also consider adding them as CI tests.

Cadair · 2024-05-02T18:58:56Z

I was planning on adding a formatter after this PR gets merged (and before we do the next release), thanks for the pointers on the other cargo commands they look really useful.

Cadair · 2024-05-03T08:31:09Z

pre-commit.ci autofix

for more information, see https://pre-commit.ci

Will-Shanks · 2024-05-03T19:57:12Z

@Cadair do you have any notes on how you're generating those graphs? Y'all have got me curious enough now I'd like to try running it, but I can't figure out what I'm doing wrong. When I try and run one of the scripts in benchmarks after running maturin develop I get a ModuleNotFoundError: No module named 'streamtracer._streamtracer_rust' error

Cadair · 2024-05-03T20:43:52Z

I am doing a plain pip install [-e] . then running benchmark.py then benchmark_plot.py to plot out the CSVs.

Cadair · 2024-05-07T09:17:03Z

I am going to merge this to unblock #137 and #135. @Will-Shanks I am very happy to continue to take suggestions as and when you have any though!

Cadair force-pushed the parallelism branch 3 times, most recently from 73ec541 to 78e90c3 Compare February 13, 2024 21:28

Cadair marked this pull request as ready for review February 13, 2024 21:28

Will-Shanks reviewed Feb 16, 2024

View reviewed changes

Implement parallelism using rayon

eab38d2

Cadair force-pushed the parallelism branch from 78e90c3 to ad122fe Compare May 1, 2024 09:30

Cadair commented May 1, 2024

View reviewed changes

Cadair force-pushed the parallelism branch from ad122fe to 50b3f4d Compare May 1, 2024 09:35

Cadair added 2 commits May 1, 2024 15:56

Some more rusty refactoring

50c1d02

Some benchmark script improvements

f7333fc

Cadair force-pushed the parallelism branch from 50b3f4d to f7333fc Compare May 1, 2024 14:56

This was referenced May 1, 2024

Release Initial Version sunpy/sunkit-magex#36

Closed

Release v2.1.0 with the parallel support #135

Closed

use & not as_slice

8b3438f

Cadair force-pushed the parallelism branch from 0c6d794 to 8b3438f Compare May 2, 2024 19:10

Even more benchmark

557fd4a

Cadair mentioned this pull request May 2, 2024

Add rust pre-commit #136

Closed

[pre-commit.ci] auto fixes from pre-commit.com hooks

30ae5df

for more information, see https://pre-commit.ci

Cadair mentioned this pull request May 4, 2024

Add a rust linter and formatter (Take 2) #137

Merged

Cadair merged commit 905c5ab into sunpy:main May 7, 2024
14 checks passed

Cadair deleted the parallelism branch May 7, 2024 09:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement parallelism using rayon #110

Implement parallelism using rayon #110

Cadair commented Feb 8, 2024 •

edited

codecov-commenter commented Feb 13, 2024 •

edited

Will-Shanks left a comment

Will-Shanks Feb 16, 2024

nabobalis commented Feb 17, 2024

Cadair May 1, 2024

Will-Shanks May 2, 2024

Cadair May 2, 2024

Cadair commented May 1, 2024

Will-Shanks commented May 2, 2024

nabobalis commented May 2, 2024

Will-Shanks commented May 2, 2024

Cadair commented May 2, 2024

Cadair commented May 3, 2024

Will-Shanks commented May 3, 2024

Cadair commented May 3, 2024

Cadair commented May 7, 2024

Implement parallelism using rayon #110

Implement parallelism using rayon #110

Conversation

Cadair commented Feb 8, 2024 • edited

codecov-commenter commented Feb 13, 2024 • edited

Codecov Report

Will-Shanks left a comment

Choose a reason for hiding this comment

Will-Shanks Feb 16, 2024

Choose a reason for hiding this comment

nabobalis commented Feb 17, 2024

Cadair May 1, 2024

Choose a reason for hiding this comment

Will-Shanks May 2, 2024

Choose a reason for hiding this comment

Cadair May 2, 2024

Choose a reason for hiding this comment

Cadair commented May 1, 2024

Will-Shanks commented May 2, 2024

nabobalis commented May 2, 2024

Will-Shanks commented May 2, 2024

Cadair commented May 2, 2024

Cadair commented May 3, 2024

Will-Shanks commented May 3, 2024

Cadair commented May 3, 2024

Cadair commented May 7, 2024

Cadair commented Feb 8, 2024 •

edited

codecov-commenter commented Feb 13, 2024 •

edited