Performance #71

SimonHeybrock · 2024-06-25T14:37:52Z

Changes

Bypass the same single-threading problem encountered in SANS, I think we either have to put a helper into ESSreduce, or see if the underlying problem should be fixed in Scipp.
Avoid storing position for every event. This will only affect Geant4 data, so the improvements from that will not be seen for NeXus data. Note that this can lead to NaN positions if there are no events. Is this a problem (not in the tests)? If so we either need to revert, or find a better solution.
Remove tof and wavelength event coord that are not needed any more.
Fixes Check workflow performance #41 (but I guess it has to be revisited sooner or later). I did not spot any other "obvious" problems in the functions taking the most time.

Baseline

Baseline for Geant4 workflow:

2.47 seconds load_geant4_csv
4.75 seconds load_geant4_csv
3.00 seconds extract_geant4_detector_data
6.11 seconds extract_geant4_detector_data
0.62 seconds normalize_by_proton_charge
4.38 seconds add_scattering_coordinates_from_positions
0.35 seconds apply_masks
0.99 seconds normalize_by_proton_charge
4.36 seconds convert_to_dspacing
5.20 seconds add_scattering_coordinates_from_positions
0.23 seconds apply_masks
2.94 seconds convert_to_dspacing
9.75 seconds focus_data_dspacing
10.12 seconds focus_data_dspacing
1.19 seconds normalize_by_vanadium_dspacing

I artificially increased the data size by modifying the data after loading:

def extract_geant4_detector_data(
    detector: RawDetector[RunType],
) -> RawDetectorData[RunType]:
    """Extract the histogram or event data from a loaded GEANT4 detector."""
    data = extract_detector_data(detector)
    data = data.bins.concatenate(data)
    data = data.bins.concatenate(data)
    data = data.bins.concatenate(data)
    data = data.bins.concatenate(data)
    data = data.bins.concatenate(data)
    data = data.bins.concatenate(data) 
    data = data.bins.concatenate(data)  # ~ 10 GByte (sample run)
    return RawDetectorData[RunType](data)

The timings reported for this function are thus irrelevant in reality.

This branch

RawDetectorData[SampleRun] is now 6 GByte instead of 10 GByte:

2.64 seconds load_geant4_csv
5.61 seconds load_geant4_csv
2.01 seconds extract_geant4_detector_data
3.84 seconds extract_geant4_detector_data
0.34 seconds normalize_by_proton_charge
0.19 seconds apply_masks
0.18 seconds convert_to_dspacing
1.38 seconds focus_data_dspacing
0.47 seconds normalize_by_proton_charge
0.22 seconds apply_masks
0.20 seconds convert_to_dspacing
1.48 seconds focus_data_dspacing
0.83 seconds normalize_by_vanadium_dspacing

…edup

SimonHeybrock · 2024-06-25T14:47:10Z

Converted to draft, since it looks like I messed up memory consumption (at least CI is having trouble, it seems).

…size

SimonHeybrock · 2024-06-25T15:46:19Z

Looking at https://github.com/scipp/essdiffraction/actions/workflows/ci.yml it seems the CI timings were a perfectly normal outlier. Ready for review once more!

jl-wynen · 2024-06-26T07:22:25Z

docs/user-guide/dream/dream-data-reduction.ipynb

-    "intermediates[MaskedData[SampleRun]].bins.concat().hist(\n",
-    "    two_theta=300, wavelength=300\n",
-    ").plot(norm=\"log\")"
+    "two_theta = sc.linspace(\"two_theta\", 0.8, 2.4, 301, unit=\"rad\")\n",


Why the explicit limits? Does this really matter so much for performance?

Because we have some voxels with NaN positions, and get an exception otherwise.

jl-wynen · 2024-06-26T07:23:03Z

src/ess/dream/io/geant4.py

+            res = da.group("sector", *elements)
+        else:
+            res = da.group(*elements)
+        res.coords['position'] = res.bins.coords.pop('position').bins.mean()


Did you check whether this changes the final result?

I was hoping there are tests for that...?

There are no tests for the complete $I(d)$. We only test a subset of properties of the output.

The problem is that the workflow is incomplete. Most updates in the near future will break regression tests.

jl-wynen · 2024-06-26T07:27:13Z

src/ess/powder/grouping.py

+def _drop_grouping_and_bin(
+    data: sc.DataArray, *, dims_to_reduce: tuple[str, ...] | None = None, edges: dict
+) -> sc.DataArray:
+    all_pixels = data if dims_to_reduce == () else data.bins.concat(dims_to_reduce)


Suggested change

def _drop_grouping_and_bin(

data: sc.DataArray, *, dims_to_reduce: tuple[str, ...] | None = None, edges: dict

) -> sc.DataArray:

all_pixels = data if dims_to_reduce == () else data.bins.concat(dims_to_reduce)

def _drop_grouping_and_bin(

data: sc.DataArray, *, edges: dict

) -> sc.DataArray:

all_pixels = data.bins.concat(dims_to_reduce)

because dims_to_reduce is never used.

I was using it for the two-theta case, but then refactored. Since I wanted to consider moving this to ESSreduce I thought I'd keep it.

jl-wynen · 2024-06-26T07:28:09Z

src/ess/powder/grouping.py

+    # inferior performance when binning (no/bad multi-threading?).
+    # We operate on the content buffer for better multi-threaded performance.
+    if all_pixels.ndim == 0:
+        content = all_pixels.bins.constituents['data']


Is it safer to use this? Because it excludes constituents that are out of range?

Suggested change

content = all_pixels.bins.constituents['data']

content = all_pixels.value

Hmm, good point that looks simpler, I will try.

jl-wynen · 2024-06-26T07:30:02Z

src/ess/powder/grouping.py

+    # inferior performance when binning (no/bad multi-threading?).
+    # We operate on the content buffer for better multi-threaded performance.
+    if all_pixels.ndim == 0:
+        content = all_pixels.bins.constituents['data']


It seems like this should be done by bin automatically. Is this possible?

See what I wrote in the PR:

Bypass the same single-threading problem encountered in SANS, I think we either have to put a helper into ESSreduce, or see if the underlying problem should be fixed in Scipp.

Ok. Can this be done in Scipp now instead of adding extra code here and potentially in other projects? If not, please open an issue!

Well, the larger issue is that we have to concat first, which in itself is avoiding other performance issues. Making many->many mapping is a long-standing issues and has seen a couple improvements in the past, but is apparently not fully solved (See scipp/scipp#1846).

What could probably be done in Scipp is bypass the single-threading issue, but it also required some thought (not sure there aren't any subtleties yet), but it is slightly odd to put an optimization for a solution that bypasses another problem.

scipp/essreduce#47

SimonHeybrock added 3 commits June 25, 2024 14:37

Minor cleanup

63a90d5

Avoid single-threaded binning of single-bin data for 3x focussing spe…

7ebbba3

…edup

Use voxel mean position for Geant4 data

011e791

SimonHeybrock requested a review from jl-wynen June 25, 2024 14:37

SimonHeybrock marked this pull request as draft June 25, 2024 14:46

SimonHeybrock added 3 commits June 25, 2024 17:13

Remove unnecessary coords to improve focussing performance and final …

e629e44

…size

Update notebook

0f8bf95

Remove now redundant position mean in docs

117bc56

SimonHeybrock marked this pull request as ready for review June 25, 2024 15:46

jl-wynen reviewed Jun 26, 2024

View reviewed changes

jl-wynen mentioned this pull request Jun 26, 2024

fix: dont load wavelength #72

Merged

SimonHeybrock mentioned this pull request Jun 26, 2024

Performant implementation of reduction operations that drop pixel grouping scipp/essreduce#47

Closed

Do not use bins.constituents

0901d8b

jl-wynen approved these changes Jun 26, 2024

View reviewed changes

SimonHeybrock merged commit f4d0421 into main Jun 28, 2024

SimonHeybrock deleted the performance branch June 28, 2024 03:24

	content = all_pixels.bins.constituents['data']
	content = all_pixels.value

Performance #71

Performance #71

Uh oh!

Conversation

SimonHeybrock commented Jun 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Baseline

This branch

Uh oh!

SimonHeybrock commented Jun 25, 2024

Uh oh!

SimonHeybrock commented Jun 25, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jl-wynen Jun 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SimonHeybrock Jun 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SimonHeybrock commented Jun 25, 2024 •

edited

Loading

jl-wynen Jun 26, 2024 •

edited

Loading

SimonHeybrock Jun 26, 2024 •

edited

Loading