Proton current correction #115

jokasimr · 2024-12-17T15:33:06Z

Fixes #78

Supersedes #84

SimonHeybrock · 2025-01-29T06:17:32Z

src/ess/reflectometry/conversions.py

+        fill_value=sc.scalar(float('nan'), unit=pc.unit),
+    )
+    # Useful for comparing the proton current to what is typical
+    da.coords['median_proton_current'] = sc.median(pc).data


Lower down we mask based on this. What if the beam was bad for some time, but there were few events. Shouldn't we compute based on the proton current of the events? Otherwise the median can be dragged down a lot and we end up masking the valid data?

Not sure what you mean.

Shouldn't we compute based on the proton current of the events?

Compute the mask? Or something else?

median can be dragged down a lot and we end up masking the valid data?

If the median is dragged down we mask less, not more. The mask is for events when the proton current is too low. Masking less can ofc also be an issue.

My point was: We would like to have the median of when the beam is "good", not the median based on time intervals. Say we have a 1 hour run but only 5 minutes good beam, wouldn't the current approach end up masking everything?

In this case "good beam" is defined by what is typical.

Say we have a 1 hour run but only 5 minutes good beam, wouldn't the current approach end up masking everything?

The current approach would end up masking nothing in that case.

The reason this is needed to begin with is that the Amor beam intensity periodically goes down briefly, and we want to mask regions with no/weak beam because otherwise we just amplify the background.
In normal operation the beam being 1/5 of the typical intensity is a good heuristic that the beam was not up.

We could also use some other heuristic here like a fixed cut-off that can be defined by the user.

SimonHeybrock · 2025-01-29T06:20:54Z

src/ess/amor/workflow.py

+    if len(proton_current) != 0:
+        add_proton_current_coord(da, proton_current)
+        add_proton_current_mask(da)
+        correct_by_proton_current(da)


I initially thought this provider is doing an in-place modification, which would be problematic. But on closer looks it seems like add_coords make (at least) a shallow copy? The way things are written now this is a bit hard to see though.

It does modify the weights in-place, but that's what we do everywhere right? (Since we don't want to copy the weights)

add_coords makes a shallow copy because that is what transform_coords returns.

but that's what we do everywhere right

As far as I am aware we are usually (but not always) avoiding input modifications in providers?

It seems you are right, I'll change it to update inplace.

SimonHeybrock · 2025-02-04T04:17:40Z

src/ess/amor/workflow.py

    da = add_coords(da, graph)
-    da = add_masks(da, ylim, zlims, bdlim, wbins)
+    add_masks(da, ylim, zlims, bdlim, wbins)
+
+    # Copy before applying corrections
+    da.data = da.data.copy()
    correct_by_footprint(da)
-    return da
+
+    # For some older Amor files there are no entries in the proton current log
+    if len(proton_current) != 0:
+        add_proton_current_coord(da, proton_current)
+        add_proton_current_mask(da)


Thinking through this once more:

add_coords makes a shallow copy (including bin contents).

add_masks does an in-place modification, despite the similar name.

We deep copy everything, including coords that do not change (this is wasteful).

Apply corrections on data in-place.

Consider whether this is good. For example, as it stands now, one should not call the functions in another other order. For example, it looks like add_masks, despite being in a different module, is an implementation detail of this function and "risky" to use in other contexts.

Performance wise, you might be better off changing correct_by_footprint and correct_by_proton_current to not modify the input. Then you can avoid the da.data.copy(), which also copies all event coords. Optionally, you may also consider combining all corrections before multiplying them into the data, but that may or may not make any difference, depending on the dimensions.

add_coords makes a shallow copy (including bin contents).

Not sure what you mean here. My understanding was that transform_coords makes a shallow copy and doesn't copy bin contents.

Then you can avoid the da.data.copy(), which also copies all event coords.

Why would da.data.copy() copy the event coords? Isn't da.data just a variable and .copy() copies that one variable?

Yes, da.data is a variable. But (if) da.data is a binned variable then the event coords are part of da.data.

Good to know 👍 I'll change it

👍 You can see this for example in the HTML repr:

jokasimr force-pushed the proton-current branch from a112f15 to 5b15893 Compare January 27, 2025 13:50

jokasimr added 2 commits January 27, 2025 15:22

feat: proton current correction

f5cb9be

docs: clarify that functions modify input inplace

db56047

jokasimr force-pushed the proton-current branch from 8446ed3 to db56047 Compare January 27, 2025 14:23

jokasimr added 2 commits January 27, 2025 15:53

test: proton current is added

68b9c6f

test: correction value is expected

cc610e4

SimonHeybrock reviewed Jan 29, 2025

View reviewed changes

fix: avoid mutating input

e23a039

SimonHeybrock approved these changes Feb 4, 2025

View reviewed changes

fix: remove inplace changes

12d2125

SimonHeybrock approved these changes Feb 4, 2025

View reviewed changes

jokasimr merged commit daee728 into main Feb 4, 2025
4 checks passed

jokasimr deleted the proton-current branch February 4, 2025 15:21

Proton current correction #115

Proton current correction #115

Uh oh!

Conversation

jokasimr commented Dec 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jokasimr Feb 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SimonHeybrock Feb 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jokasimr commented Dec 17, 2024 •

edited

Loading

jokasimr Feb 3, 2025 •

edited

Loading

SimonHeybrock Feb 4, 2025 •

edited

Loading