Experimental weighting #2

elindgren · 2025-04-07T11:30:28Z

Adds a weighting parameter "weight" to an Experiment.
The residuals are weighted by a weighting factor. Note that this means that the contribution to chi_squared is weighted by a factor of weight**2.

Skipping error handling for now, ideally the joint_fit object should be converted to a Python object that validates the weights etc. But let's not get ahead of ourselves.

Fixes #5

…impacts chi squared

…ey for weights

…ced chi-squared

…experiments

elindgren · 2025-04-07T11:31:42Z

Migrating some of the discussion here for future reference:

By @AndrewSazonov

I’d suggest the following API structure, following the same logic we use for linked phases. For example:

expt.linked_phases.add("phase1", scale=0.5)
expt.linked_phases.add("phase2", scale=1.3)

In this model, the phase doesn’t know anything about its contribution to an experiment. It’s the experiment that tracks which phases are involved, and their relative contributions (via scale).

For experiments, I propose we follow the same principle. An experiment is just an experiment. It shouldn't know anything about its role or weight in a fitting process. That responsibility should live in the analysis object.

Suppose we have:

analysis.fit_mode = "joint"

When this mode is selected, a collection like analysis.joint_fit becomes accessible. Then we can do:

analysis.joint_fit.add("expt1", weight=0.4)  # Default weight could be 0.5
analysis.joint_fit.add("expt2", weight=0.6)  # Default weight could be 0.5

Only the experiments listed in joint_fit would be used during analysis.fit().

Alternatively, switching analysis.fit_mode from "single" to "joint" could automatically populate analysis.joint_fit with all defined experiments, each with equal default weights.

Then the user can adjust weights like this:

analysis.joint_fit["expt1"].weight = 0.4
analysis.joint_fit["expt2"].weight = 0.6

This keeps the model consistent with how we've already handled phase contributions and keeps responsibility for fit-specific configuration clearly within the analysis object.

elindgren · 2025-04-07T11:33:32Z

By @elindgren

@AndrewSazonov I decided on keeping the weights in the _residual_function, instead of moving them to where reduced chi-squared is computed. It didn't really make sense to have it there, especially when considering other fitting modes than joint. Instead, I multiply the residuals with the square roots of the weights, so that the weights enter linearly into the chi-squared.

Also, I normalized the weights such that they sum up to the number of experiments. A two-experiment fit, with weights 0.5, will thus effectively have weights 1.0 and leave the reduced chi-squared unchanged compared to before. Thus, if one adds multiple experiments, one would expect the chi-squared to grow with the number of expeirments, even if all fits are good. Alternatively, one could have the weights sum up to 1, which would leave the chi-squared unchanged as more experiments are added (if all the fits are equally good). I'm not really sure which of these two is preferred, what do you say @AndrewSazonov?

For reference, we decided that the sum of weights should sum up to the number of experiments. This ensures that one retrieves the same reduced chi_squared when fitting a dataset as when splitting the dataset in two parts and fitting the two parts together.

elindgren · 2025-04-07T11:34:38Z

tests/functional_tests/fitting/test_joint-fit.py


    # Compare fit quality
-    assert_almost_equal(project.analysis.fit_results.reduced_chi_square, 21.1, decimal=1)
+    assert_almost_equal(project.analysis.fit_results.reduced_chi_square, 14.4, decimal=1)


Note that I had to adjust the expected chi_squared after changing the target sum of weights.

src/easydiffraction/experiments/experiments.py

rozyczko · 2025-04-07T12:00:47Z

src/easydiffraction/analysis/analysis.py


        # Run the fitting process
        experiment_ids = list(experiments._items.keys())



the line above should also reuse the new experiments.ids property

Fixed, thanks!

src/easydiffraction/analysis/analysis.py

AndrewSazonov · 2025-04-07T14:08:23Z

Two comments for now:

Changing the weights of the two experiments in joint-fit_split-single-dataset.py doesn’t seem to affect the resulting chi2 value - it consistently remains at 4.66. This value should vary depending on the individual experiment weights.

I suggest implementing the following API for selecting and weighting experiments in a joint fit:

project.analysis.joint_fit_experiments.add("npd", weight=0.6)
project.analysis.joint_fit_experiments.add("xrd", weight=0.4)

And to update the weights later, if needed:

project.analysis.joint_fit_experiments["npd"].weight = 0.7
project.analysis.joint_fit_experiments["xrd"].weight = 0.3

…ciated weights

elindgren · 2025-04-08T14:31:18Z

@AndrewSazonov I've implemented the two points we discussed yesterday.

AndrewSazonov · 2025-04-09T07:59:40Z

It works as expected in terms of the resulting chi2 values. Thank you @elindgren for implementing this.

One minor API detail. The correct way to update experiment weights should be:

project.analysis.joint_fit_experiments["npd"].weight = 0.7
project.analysis.joint_fit_experiments["xrd"].weight = 0.3

instead of:

project.analysis.joint_fit_experiments["npd"] = 0.7
project.analysis.joint_fit_experiments["xrd"] = 0.3

But, since I’m currently working on significant changes in the fix-output branch, I suggest we go ahead and merge this PR, and I’ll handle the adjustment there.

elindgren · 2025-04-09T09:00:44Z

It works as expected in terms of the resulting chi2 values. Thank you @elindgren for implementing this.

One minor API detail. The correct way to update experiment weights should be:
project.analysis.joint_fit_experiments["npd"].weight = 0.7
project.analysis.joint_fit_experiments["xrd"].weight = 0.3
instead of:
project.analysis.joint_fit_experiments["npd"] = 0.7
project.analysis.joint_fit_experiments["xrd"] = 0.3
But, since I’m currently working on significant changes in the fix-output branch, I suggest we go ahead and merge this PR, and I’ll handle the adjustment there.

ah okay, I missed that! Sounds good, I'll merge then

elindgren and others added 10 commits April 2, 2025 16:19

add first example of experimental weighting

d66a0a0

scale expected chi squared by a factor of 4, since default weighting …

c1fd60d

…impacts chi squared

play around with joint_fit api; currently using ID of experiment as k…

58b7120

…ey for weights

take the square root of weights so that they enter linearly into redu…

54184b3

…ced chi-squared

automatically create the joint fit object, with 0.5 as default weight

95828c6

use a sum of weights to 1 instead of N

753a612

update test to actually set the joint_fit object

521070e

Adds two new examples to show expected GOF in joint fit

de43310

increase expected chi_squred to expected value when fitting multiple …

cf13aaf

…experiments

Merge remote-tracking branch 'origin/develop' into weight-experiments

c000f5d

elindgren added the [scope] enhancement Adds/improves features (major.MINOR.patch) label Apr 7, 2025

elindgren requested a review from AndrewSazonov April 7, 2025 11:33

elindgren commented Apr 7, 2025

View reviewed changes

rozyczko reviewed Apr 7, 2025

View reviewed changes

elindgren added 2 commits April 7, 2025 14:17

make better use of ids property

aa83932

use hasattr to check for existance of ids

6075581

AndrewSazonov and others added 3 commits April 7, 2025 16:09

Updates data paths

1ebc893

fix issue where joint_fit was not updated because of setdefault

7962ca7

implement JointFitExperiments as a container for experiments and asso…

6702551

…ciated weights

elindgren merged commit 48f365c into develop Apr 9, 2025

AndrewSazonov deleted the weight-experiments branch August 6, 2025 12:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Experimental weighting #2

Experimental weighting #2

Uh oh!

elindgren commented Apr 7, 2025 •

edited

Loading

Uh oh!

elindgren commented Apr 7, 2025

Uh oh!

elindgren commented Apr 7, 2025

Uh oh!

elindgren Apr 7, 2025

Uh oh!

Uh oh!

rozyczko Apr 7, 2025

Uh oh!

elindgren Apr 7, 2025

Uh oh!

Uh oh!

AndrewSazonov commented Apr 7, 2025

Uh oh!

elindgren commented Apr 8, 2025

Uh oh!

AndrewSazonov commented Apr 9, 2025

Uh oh!

elindgren commented Apr 9, 2025

Uh oh!

Uh oh!


		# Run the fitting process
		experiment_ids = list(experiments._items.keys())

Experimental weighting #2

Experimental weighting #2

Uh oh!

Conversation

elindgren commented Apr 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elindgren commented Apr 7, 2025

Uh oh!

elindgren commented Apr 7, 2025

Uh oh!

elindgren Apr 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rozyczko Apr 7, 2025

Choose a reason for hiding this comment

Uh oh!

elindgren Apr 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AndrewSazonov commented Apr 7, 2025

Uh oh!

elindgren commented Apr 8, 2025

Uh oh!

AndrewSazonov commented Apr 9, 2025

Uh oh!

elindgren commented Apr 9, 2025

Uh oh!

Uh oh!

elindgren commented Apr 7, 2025 •

edited

Loading