Add configurable global mean removal transform by mcgibbon · Pull Request #1193 · ai2cm/ace

mcgibbon · 2026-05-26T16:00:00Z

Add an optional global_mean_removal config to SingleModuleStepConfig (and SingleModuleStepperConfig) that removes per-sample global means from fields before normalization and restores them after denormalization. This lets the network operate on anomalies relative to the current global mean, which can improve generalization for temperature-like fields under climate drift.

Changes:

fme.core.step.global_mean_removal: new module with SharedGlobalMeanRemoval (single reference field offset applied to a set of fields) and PerChannelGlobalMeanRemoval (each field's own mean removed independently). Both support optionally appending the removed mean as extra normalized input channels.
fme.core.step.SingleModuleStepConfig: new optional global_mean_removal field; forward transform applied before normalization, inverse transform applied after denormalization
fme.ace.stepper.SingleModuleStepperConfig: passes global_mean_removal through to SingleModuleStepConfig
Tests added
If dependencies changed, "deps only" image rebuilt and "latest_deps_only_image.txt" file updated

Extracts global mean removal from the normalizer into a separate transform that wraps SingleModuleStep, supporting shared-reference and per-channel modes with optional extra input channels. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

…ed field names Remove dead from_state method on GlobalMeanRemovalConfig. Update docstrings to explain that output-only fields are intentionally un-shifted by inverse_transform (the network learns to compensate via end-to-end training). Log a warning when field_names entries appear in neither in_names nor out_names. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Move result tensors to CPU before comparing with CPU-created expected tensors, fixing failures on GPU CI. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

…onfig ABC Corrector, ocean, and prescribed prognostics now run in physical space (after inverse_transform) when global_mean_removal is active, so they see un-shifted values. Remove the unused GlobalMeanRemovalConfig ABC since only the union type is used in type hints. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Moves forward_transform/inverse_transform into step_with_adjustments so corrector/ocean/prescribed adjustments stay in one place instead of being duplicated outside. Adds NoGlobalMeanRemoval null class to eliminate the forked code path in SingleModuleStep.step(). Fixes device mismatch in test_per_channel_masked_uses_zero by moving data_mask to the test device. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

The per-channel forward_transform was subtracting each field's per-sample spatial mean in physical space, leaving a post-normalization bias of -clim_mean/clim_std on every input pixel. For fields with significant climatology means (e.g. absolute temperatures) this fed the network large constant offsets and produced NaNs during training. Shift each field by clim_mean - sample_mean instead, mirroring the shared variant, so the post-normalization spatial mean is approximately zero. For masked samples the shift is zero (no forward or inverse shift), and the extra-channel formula becomes -shift/std (the anomaly for unmasked samples, zero for masked). Add regression tests that assert the post-normalization spatial mean is ~0 with realistic climatologies, including the masked case. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

mcgibbon · 2026-05-27T15:42:03Z

The way this is implemented raises some questions about interaction with masked-input training. The way masking is implemented right now, the step requires surface temperature always be provided when this feature is on, so that it can compute the global-mean and use it to normalize the other features.

I think it's also fairly clear we would want to be able to train while independently masking e.g. surface temperature and this new global-mean surface temperature input.

On the question of information leakage, we could later add a feature to noise this removed global-mean so it doesn't contain such reliable information about the global mean surface temperature.

We have two types of masking for inputs - missing data, and "we want to train this batch without this input". For missing data, the current behavior is basically correct - we can't train on a sample without surface temperature, unless we modify the scheme for the temperature removal to be less dependent on that field. For the "we want to train this batch without this input", we could maybe think about this as a dropout instead?

yyexela

Overall looks great! Just some minor comments.

Links the unit-level value coverage in test_global_mean_removal.py to the full step by asserting that enabling global_mean_removal changes the output relative to the baseline with the same seed/weights/inputs. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

yyexela

Issue: The new commit adds a test called test_step_global_mean_removal_affects_output that tests if the outputs of a full step with/without PerChannelGlobalMeanRemovalConfig differ. This test does not verify that SharedGlobalMeanRemovalConfig also produces different outputs.

Suggestion: Copy this test into two:

test_step_per_channel_global_mean_removal_affects_output
and
test_step_shared_global_mean_removal_affects_output

This then verifies that the transform is invoked during the step for both config types.

yyexela · 2026-05-28T15:25:41Z

Question: Should NoGlobalMeanRemoval be tested? I think it's worth verifying that when no global mean removal is selected, forcing the use of NoGlobalMeanRemoval, the functionality doesn't change. Though I guess one could argue that since the rest of the unit-tests were not affected by using NoGlobalMeanRemoval then it's implicitly tested.

mcgibbon · 2026-05-28T16:11:52Z

Though I guess one could argue that since the rest of the unit-tests were not affected by using NoGlobalMeanRemoval then it's implicitly tested.

Yes, I would say this is already covered by the backwards-compatibility inference tests.

Adds symmetric coverage for SharedGlobalMeanRemovalConfig alongside the existing PerChannelGlobalMeanRemovalConfig case via a shared helper. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

yyexela · 2026-05-28T17:35:31Z

Changes look good to me! ✅ I approve

Arcomano1234

Left a question that should be addressed / answered and a nit that I think should also be fixed

Arcomano1234 · 2026-05-28T18:21:56Z

+                    "which is not supported for shared global mean removal."
+                )
+        ref = input[ref_name]
+        sample_mean = ref.mean(dim=tuple(range(1, ref.ndim)))


Question: When we do the stats calculation for variables we use the area averaged mean correct? I was curious if you think this makes a difference / should we use aree-weighted just to match everything else we do

I did think about doing it that way, but I don't think it will make a difference or be worth the added complexity.

Yeah I don't think it should have profound effect but I think we should document it somewhere as I think most if not all of our "global means" in ACE are area-weighted. So if in the doc strings we say something like "subtract the raw global mean" that should be good enough

How about "cellwise" global mean?

I think our convention is generally to call the area-weighted global mean the area-weighted or weighted global mean, and I think cellwise global mean would be more verbose than needed in the variable/configuration names, but I think we could use it in the class names and docstrings.

Yeah we don't need to be that verbose for the variable names / function names just want to make sure its explicitly documented in the docstrings so we have a reference and so its obvious to external users.

Claude: Added cellwise-vs-area-weighted note in docstrings on GlobalMeanRemoval (the primary explanation, with rationale: simpler, network compensates during end-to-end training), and propagated short references to the two concrete impls (SharedGlobalMeanRemoval, PerChannelGlobalMeanRemoval) and both Config dataclasses so the distinction is obvious to external users at the user-facing entry points. Pushed in 786b074.

The method was only used by tests; production code reads ``n_extra_input_channels`` from the built transform object. Drop it from both configs and refactor the one remaining test to use the build path. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Arcomano1234

I still think we should document it some where that we are using "cellwise" means for the subtraction but other than that this looks good to go. That documentation should be added but I don't need to re-review this, so I am approving the PR.

ACE conventionally uses area-weighted global means for stats and metrics; this transform deliberately uses the simpler cellwise (unweighted) mean and relies on end-to-end training to absorb the difference. Note this in the ABC and propagate to the concrete impl and config docstrings so the distinction is obvious to external users. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

mcgibbon and others added 3 commits May 26, 2026 15:30

Fix device mismatch in per-channel global mean removal tests

e652816

Move result tensors to CPU before comparing with CPU-created expected tensors, fixing failures on GPU CI. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

mcgibbon commented May 26, 2026

View reviewed changes

Comment thread fme/core/step/global_mean_removal.py Outdated

Comment thread fme/core/step/single_module.py Outdated

mcgibbon and others added 3 commits May 26, 2026 17:58

mcgibbon requested a review from yyexela May 27, 2026 15:42

mcgibbon mentioned this pull request May 27, 2026

Add normalizer option to remove global mean surface temperature #1191

Closed

2 tasks

yyexela reviewed May 27, 2026

View reviewed changes

Comment thread fme/core/step/test_global_mean_removal.py

Comment thread fme/core/step/test_step.py

Merge branch 'main' into feature/global-mean-removal-transform

36d637d

mcgibbon requested a review from Arcomano1234 May 27, 2026 21:00

yyexela reviewed May 27, 2026

View reviewed changes

Split step-level transform test for shared and per-channel configs

8f1c77b

Adds symmetric coverage for SharedGlobalMeanRemovalConfig alongside the existing PerChannelGlobalMeanRemovalConfig case via a shared helper. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Arcomano1234 reviewed May 28, 2026

View reviewed changes

Arcomano1234 approved these changes May 28, 2026

View reviewed changes

mcgibbon and others added 2 commits May 28, 2026 19:01

Merge branch 'main' into feature/global-mean-removal-transform

5a7b1f5

mcgibbon enabled auto-merge (squash) May 28, 2026 19:31

mcgibbon merged commit c65f8db into main May 28, 2026
7 checks passed

mcgibbon deleted the feature/global-mean-removal-transform branch May 28, 2026 19:46

Conversation

mcgibbon commented May 26, 2026

Uh oh!

Uh oh!

Uh oh!

mcgibbon commented May 27, 2026

Uh oh!

yyexela left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

yyexela left a comment

Choose a reason for hiding this comment

Uh oh!

yyexela commented May 28, 2026

Uh oh!

mcgibbon commented May 28, 2026

Uh oh!

yyexela commented May 28, 2026

Uh oh!

Arcomano1234 left a comment

Choose a reason for hiding this comment

Uh oh!

Arcomano1234 May 28, 2026

Choose a reason for hiding this comment

Uh oh!

mcgibbon May 28, 2026

Choose a reason for hiding this comment

Uh oh!

Arcomano1234 May 28, 2026

Choose a reason for hiding this comment

Uh oh!

mcgibbon May 28, 2026

Choose a reason for hiding this comment

Uh oh!

mcgibbon May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Arcomano1234 May 28, 2026

Choose a reason for hiding this comment

Uh oh!

mcgibbon May 28, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Arcomano1234 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mcgibbon May 28, 2026 •

edited

Loading