FIX: Model/calculate: Piecewise branching performance and extrapolation #431

richardotis · 2022-10-13T16:47:58Z

Complex multi-sublattice phases with lots of parameters and using the magnetic ordering model can challenge pycalphad's Just-In-Time (JIT) compiler, especially for computation of second derivatives. In the worst case, the compiler will hang indefinitely and consume RAM until the process is killed.

The reason for this is the increase in algorithmic complexity that comes from having deep piecewise temperature branching in the Model object's representation of the Gibbs energy. However, a common case for the piecewise parameter description is that there is really only one nonzero "branch" for the entire temperature range. While TDB formally wraps every parameter in a piecewise, the Model object is free to discard trivial branches at build time. That is the approach used in the patch for this PR.

This PR includes a test for such a difficult case, where the Model object for the corresponding phase has a Gibbs energy Hessian that cannot be built by the develop JIT compiler. The patch is able to reduce the number of Piecewise nodes in the Gibbs energy's abstract syntax tree by more than half. For the sake of efficiency, instead of a full correctness test we only test that the number of nodes is reduced by half.

In addition, this PR includes a change to the point sampling algorithm in calculate. Currently the sampler (when fixed_grid is True) tries to add additional points between all pairwise combinations of endmembers. For certain multi-component, multi-sublattice phases, there can be thousands of endmembers and, thus, millions of endmember pairs. The proposed change detects this case; when there are more than 100,000 points to be added, the algorithm only adds up to the maximum specified by the constant. All endmembers are still added, and this change does not affect the random sampling portion of the algorithm.

Finally, in addition to trivial branch elimination, the Model class is now updated to extrapolate the lower and upper temperature bounds of all piecewise expressions to negative and positive infinity. This brings pycalphad into line with how TC will extrapolate outside of temperature bounds for parameters. Note that limits specified by the TEMPERATURE-LIMITS TDB command are still not enforced and it is still possible for users to compute in non-physical regions of a database, but this was always possible; this change will allow for compatibility with a greater number of legacy databases that rely on the extrapolation behavior.

For certain pathological cases, this will reduce memory consumption by over 90% and resolve classes of memory errors for users attempting multi-component calculations with complex databases.

This reverts commit 6ef8a4c.

…oo many

codecov · 2022-10-15T01:52:31Z

Codecov Report

Merging #431 (3541397) into develop (dd97b10) will increase coverage by 0.18%.
Report is 4 commits behind head on develop.
The diff coverage is 98.95%.

@@             Coverage Diff             @@
##           develop     #431      +/-   ##
===========================================
+ Coverage    90.30%   90.48%   +0.18%     
===========================================
  Files           50       50              
  Lines         7774     7871      +97     
===========================================
+ Hits          7020     7122     +102     
+ Misses         754      749       -5

Files Changed	Coverage Δ
pycalphad/core/minimizer.pyx	`85.66% <75.00%> (+1.33%)`	⬆️
pycalphad/core/calculate.py	`93.57% <100.00%> (+0.02%)`	⬆️
pycalphad/core/constants.py	`100.00% <100.00%> (ø)`
pycalphad/model.py	`92.35% <100.00%> (+0.41%)`	⬆️
pycalphad/tests/test_calculate.py	`100.00% <100.00%> (ø)`
pycalphad/tests/test_equilibrium.py	`98.47% <100.00%> (+0.02%)`	⬆️
pycalphad/tests/test_model.py	`100.00% <100.00%> (ø)`

... and 3 files with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

richardotis · 2022-10-15T01:55:06Z

I didn't manage to get to root cause, but I did verify that you can unwrap Piecewise for every energy contribution except the ordering contribution without hitting the Mac-only failure in test_eq_alni_high_temp and test_eq_alni_low_temp. The problem is then you don't reduce enough of the Piecewise nodes to pass test_model_deep_branching. I switched the unwrapping to happen inside of Model.symbol_replace, and then I can pass the suite (including all new tests) on all platforms.

I have added a test, ~~test_calculation_jitter (jitter isn't the right term, probably needs a rename)~~ test_calculation_symengine_evalf_energy_difference, which should help explicitly detect this strange case in the future if it arises.

pycalphad/model.py

…ping

…airs, not the max

richardotis · 2023-07-14T23:43:41Z

@bocklund ping

bocklund · 2023-08-02T23:22:33Z

One hesitation I have for this PR is that the mixed behavior between expressions that do extrapolate outside of temperature limits automatically and expressions that don't will make it more difficult to debug cases where users are doing calculations outside of temperature limits.

I think this PR is valuable and probably the sensible solution is to extrapolate everywhere. Is it too big of an effort or scope change to do that here? Does it need a separate issue?

richardotis · 2023-08-02T23:36:37Z

I'd argue it's undefined behavior in pycalphad to compute outside of the temperature limits, and it'd still be undefined behavior after this PR was merged. I agree it's a footgun and the user has no way of knowing without examining the database, though. (PR has been updated to enable full extrapolation)

For adjusting extrapolation behavior, there are two approaches:

Database: modify the limits at Database construction time; pros: easy to do; cons: breaks roundtrip consistency
Model: modify the limits at Model construction time; pros: protects Database integrity; cons: more complex algorithm, have to walk the expression tree, unclear whether we can detect all the cases, potentially a performance challenge

…s expected

richardotis · 2023-08-11T00:59:53Z

@bocklund This PR now extrapolates all temperature bounds, including multi-branch piecewise expressions. This uses the Model-based extrapolation approach referenced above. It ended up being a smaller delta than I expected, and the performance seems to be fine (though feel free to test on complex databases). The small performance impact makes sense to me in retrospect, as we expect fewer than 100 Piecewise atoms per model, and many of those will be trivial non-zero branches that hit the fast path in Model.unwrap_piecewise.

You can look at the added test to see how you could test the performance while staying on the same branch. In practice, I'm not sure Model.extrapolate_temperature_bounds should be part of the public API as setting it to False merely disables the multi-branch extrapolation but keeps the trivial non-zero branch pruning proposed here, which we need for performance reasons. I'm actually not sure we could guarantee no extrapolation at all, but if you think it's worth it to try we could move the Model.extrapolate_temperature_bounds check further up in Model.unwrap_piecewise to protect both code branches. This would allow users to fully turn off the Model behavior changes introduced in this PR if desired.

bocklund

What's here seems reasonable to me. Although I agree that it's technically correct, I haven't seen a use case in the wild that used or relied on temperature limits in a useful way. I'm not too worried about going out of our way to support fully correct non-extrapolation for now (YAGNI).

richardotis added 3 commits October 13, 2022 09:32

FIX: Model: Optimize away trivial Piecewise branches for performance

9eed7c9

FIX: Model: Use raw string to avoid escape sequence in docstring

4c0318f

TST: test_model: Add newline at end of file

e4698a4

richardotis added the bug label Oct 13, 2022

richardotis added this to In progress in Path to 1.0 Oct 13, 2022

richardotis added 4 commits October 13, 2022 10:48

WIP: Model: Try removing unnecessary branch to fix Mac errors

ec4bc10

CI test only, should be reverted

6ef8a4c

Revert "CI test only, should be reverted"

671e006

This reverts commit 6ef8a4c.

FIX/TST: calculate: Do not sample between endmembers when there are t…

042af2a

…oo many

richardotis changed the title ~~FIX: Model: Deep piecewise branching performance~~ FIX: Model/calculate: Deep piecewise branching performance Oct 14, 2022

richardotis added 15 commits October 14, 2022 11:37

Debugging code

5c731eb

fix typo

3ff08a0

more debugging code

4a5e587

debugging toggle verbosity for problem tests

c38693f

debugging add energy test

4eed79f

debugging force lambda backend

e662dea

Debugging focus fcc_l12 phase

4c51e1b

debugging piecewise atom level

cacd39a

debugging fix

3d3a28d

debugging force numerical result

5e54abd

debugging switch to energy level evaluation

9fc9bc5

WIP: attempt to fix numerical difference through forced type conversion

ddfa33b

debugging see if disordered fcc has the same issue

644d3df

WIP: Try not replacing the ordering contribution

82ec816

WIP: Try unwrapping symbols only

b5be57c

richardotis requested a review from bocklund October 15, 2022 01:56

richardotis added 2 commits October 15, 2022 09:28

BLD: deploy: Bump versions for wheel build/deploy

771564f

TST: test_calculate: Change test name

aa279f2

MAINT: test_equilibrium: Remove debug verbosity from equilibrium calls

a54faec

bocklund requested changes Feb 17, 2023

View reviewed changes

pycalphad/model.py Outdated Show resolved Hide resolved

pycalphad/model.py Outdated Show resolved Hide resolved

richardotis added 3 commits February 18, 2023 10:20

Merge branch 'develop' into fix-large-branching

ee308d3

MAINT: Model: Remove unnecessary float coercion code

aadfda3

MAINT: Model: Try removing iteration count limit for Piecewise unwrap…

fb140a2

…ping

richardotis requested a review from bocklund February 18, 2023 17:30

richardotis added 3 commits April 6, 2023 17:13

FIX: calculate: Still add some extra_points for very complex phases

0cafa39

Merge branch 'develop' into fix-large-branching

1abd412

FIX: calculate: Compute lingrid based on actual number of endmember p…

acdb576

…airs, not the max

richardotis added 3 commits August 10, 2023 11:31

ENH/FIX: Model: Extrapolate beyond temperature limits by default

0701ee6

Merge branch 'develop' into fix-large-branching

87df9f5

TST/FIX: extrapolate_temperature: Confirm Model extrapolation works a…

3541397

…s expected

richardotis changed the title ~~FIX: Model/calculate: Deep piecewise branching performance~~ FIX: Model/calculate: Piecewise branching performance and extrapolation Aug 11, 2023

bocklund approved these changes Aug 22, 2023

View reviewed changes

richardotis merged commit 7224eb8 into pycalphad:develop Aug 23, 2023
26 checks passed

Path to 1.0 automation moved this from In progress to Done Aug 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX: Model/calculate: Piecewise branching performance and extrapolation #431

FIX: Model/calculate: Piecewise branching performance and extrapolation #431

richardotis commented Oct 13, 2022 •

edited

codecov bot commented Oct 15, 2022 •

edited

richardotis commented Oct 15, 2022 •

edited

richardotis commented Jul 14, 2023

bocklund commented Aug 2, 2023 •

edited

richardotis commented Aug 2, 2023 •

edited

richardotis commented Aug 11, 2023 •

edited

bocklund left a comment

FIX: Model/calculate: Piecewise branching performance and extrapolation #431

FIX: Model/calculate: Piecewise branching performance and extrapolation #431

Conversation

richardotis commented Oct 13, 2022 • edited

codecov bot commented Oct 15, 2022 • edited

Codecov Report

richardotis commented Oct 15, 2022 • edited

richardotis commented Jul 14, 2023

bocklund commented Aug 2, 2023 • edited

richardotis commented Aug 2, 2023 • edited

richardotis commented Aug 11, 2023 • edited

bocklund left a comment

Choose a reason for hiding this comment

richardotis commented Oct 13, 2022 •

edited

codecov bot commented Oct 15, 2022 •

edited

richardotis commented Oct 15, 2022 •

edited

bocklund commented Aug 2, 2023 •

edited

richardotis commented Aug 2, 2023 •

edited

richardotis commented Aug 11, 2023 •

edited