Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parallelise dials.rs_mapper #2238

Merged
merged 6 commits into from
Oct 31, 2022
Merged

Parallelise dials.rs_mapper #2238

merged 6 commits into from
Oct 31, 2022

Conversation

dagewa
Copy link
Member

@dagewa dagewa commented Oct 4, 2022

Perform the rs_mapper calculation in parallel. The resulting map files are identical. Testing using the thaumatin dataset on dials.data showed that with 16 parallel processes, the program completed 3.4 times faster (7s compared to 24s). With the default grid size the memory requirements increased by a similar factor, but are still modest (1.7 GB compared to 520 MB).

The PR also introduces logging to replace prints, though there is still not yet much console output.

dagewa and others added 6 commits October 3, 2022 13:35
Working towards each process having its own grid and counts arrays
I checked that the output map is identical between this branch and
main for the dials.data insulin dataset. On the branch this was
split over 5 blocks and took about 3 seconds compared with 5
seconds on main. Memory requirements on the branch were a bit less
than double that on main (810 MB compared with 450 MB).

For the thaumatin dataset, the branch took 7s compared to 24s on
main, with the job split over 16 processes. Memory required went
up to 1.7 GB from 520 MB.
@codecov
Copy link

codecov bot commented Oct 4, 2022

Codecov Report

Merging #2238 (3a3f4b8) into main (c4b0d0e) will decrease coverage by 0.53%.
The diff coverage is 80.76%.

@@            Coverage Diff             @@
##             main    #2238      +/-   ##
==========================================
- Coverage   80.99%   80.46%   -0.54%     
==========================================
  Files         588      587       -1     
  Lines       66876    66870       -6     
  Branches     9440     8888     -552     
==========================================
- Hits        54169    53806     -363     
- Misses      10614    10999     +385     
+ Partials     2093     2065      -28     

@ndevenish ndevenish merged commit 7bc2a99 into main Oct 31, 2022
@ndevenish ndevenish deleted the parallelise-rs_mapper branch October 31, 2022 11:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants