Serialisation of models #3397

pipliggins · 2023-10-02T16:01:17Z

Description

Creates a serialisation method for writing PyBaMM models to JSON. Models can be written to/read-in from JSON by saving the discretised model properties. Variables, meshes and geometry can be optionally saved if users wish to use PyBaMM's plotting functionality when a model is read back in and solved.

A Serialise class is created to contain the methods, including custom JSON encoders for PyBaMM objects which can iterate through expression trees.
Symbols and meshes have to_json and from_json functions added to enable PyBaMM objects to be serialised using JSON.

Fixes # 2787

Type of change

Please add a line in the relevant section of CHANGELOG.md to document the change (include PR #) - note reverse order of PR #s. If necessary, also add to the list of breaking changes.

New feature (non-breaking change which adds functionality)
Optimization (back-end change that speeds up the code)
Bug fix (non-breaking change which fixes an issue)

Key checklist:

No style issues: $ pre-commit run (or $ nox -s pre-commit) (see CONTRIBUTING.md for how to set this up to run automatically when committing locally, in just two lines of code)
All tests pass: $ python run-tests.py --all (or $ nox -s tests)
The documentation builds: $ python run-tests.py --doctest (or $ nox -s doctests)

You can run integration tests, unit tests, and doctests together at once, using $ python run-tests.py --quick (or $ nox -s quick).

Further checks:

Code is commented, particularly in hard-to-understand areas
Tests added that prove fix is effective or that feature works

Create to_json() functions in corresponsing classes Working basic de/serialisation

Creates _from_json() functionality

Stores save_/load_model functions. Currently working for default models. Add errors, make accessible from Simulation

Option to save mesh, variables and geometry Draft notebook written Added warning if variables are not provided and try to plot

Put warning in for BaseModel - atm requires more model information to re-create the model.

allow interpolant to be serialised fix concatenation with debug mode switch msmr warnings

allow BaseModel to run without rhs

Add to_from_json test for Events

add to_json tests for meshes All but 3 int. tests passing, high accuracy diff failures

update integration test to pass at lower accuracy Remove outputs from example notebook

review-notebook-app · 2023-10-02T16:01:21Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

martinjrobins

Thanks @pipliggins, this is great work to serialise/deserialise such a large and complicated class(es)! I've only got up to pybamm/models/event.py in the review and need to stop for now, so I'll just add this part as a comment then finish the review later. I notice that there is a failing test to fix as well, looks like a small thing though

pybamm/expression_tree/array.py

pybamm/expression_tree/operations/serialise.py

pybamm/expression_tree/unary_operators.py

pybamm/meshes/one_dimensional_submeshes.py

martinjrobins

finished going through this now, sorry for the delay! Looks excellent @pipliggins, happy to merge once you've dealt with the minor comments below

pybamm/plotting/quick_plot.py

tests/unit/test_expression_tree/test_interpolant.py

tests/unit/test_expression_tree/test_unary_operators.py

* Add pybamm version to JSON file * Re-word missing variable message * Refactor unary_operator _from_json()

codecov · 2023-10-20T00:12:25Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (5322895) 99.58% compared to head (df35b91) 99.59%.
Report is 21 commits behind head on develop.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #3397      +/-   ##
===========================================
+ Coverage    99.58%   99.59%   +0.01%     
===========================================
  Files          256      257       +1     
  Lines        20131    20639     +508     
===========================================
+ Hits         20048    20556     +508     
  Misses          83       83

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Required for macOS python<3.11

martinjrobins

Thanks @pipliggins, all looks good, except for a few added lines that arn't covered by tests, see codecov report.

brosaplanella · 2023-11-15T12:18:13Z

The failing mac tests are unrelated, so feel free to review and merge if you think it is ready.

martinjrobins

looks great, thanks @pipliggins. Happy to merge once changelog is added to

TomTranter · 2023-11-23T14:47:51Z

@pipliggins @martinjrobins thanks very much for working on this. I think it will make PyBaMM much more transferable. I checked out the branch and ran into a couple of issues though when trying to couple a loaded model with an experiment. All the examples just solve with a time assuming fixed current.

This code fails

import pybamm
model = pybamm.lithium_ion.SPM()
experiment = pybamm.Experiment(["Discharge at 1C for 1 hour"])
sim = pybamm.Simulation(model, experiment=experiment)
sim.solve()
sim.save_model("spm_experiment", mesh=False, variables=False)

with

NotImplementedError:
PyBaMM can only serialise a discretised model.
Ensure the model has been built (e.g. run solve()) before saving.

and this code fails

# create the model
spm_model = pybamm.lithium_ion.SPM()
# set up and discretise ready to solve
geometry = spm_model.default_geometry
param = spm_model.default_parameter_values
param.process_model(spm_model)
param.process_geometry(geometry)
mesh = pybamm.Mesh(geometry, spm_model.default_submesh_types, spm_model.default_var_pts)
disc = pybamm.Discretisation(mesh, spm_model.default_spatial_methods)
disc.process_model(spm_model)
# Serialise the spm_model, providing the varaibles and the mesh
spm_model.save_model("example_model", variables=spm_model.variables, mesh=mesh)
# read back in
new_spm_model = pybamm.load_model("example_model.json")
experiment = pybamm.Experiment(["Discharge at 1C for 1 hour"])
new_spm_sim = pybamm.Simulation(new_spm_model, experiment=experiment)
# plot the solution
new_spm_sim.solve()
new_spm_sim.plot()

with
AttributeError: 'DomainConcatenation' object has no attribute '_full_mesh'

Is there a plan to implement this or does it require experiments to be serialized. Would be a shame if you couldn't re-use the model with different simulations.

pipliggins · 2023-11-24T10:22:57Z

@TomTranter thanks for raising this! Models coupled to experiments isn't a workflow we'd considered yet, and given the capacity for a single experiment to hold multiple models this will require some work/thinking around how best to structure the serialised file(s), for import particularly.

Having spoken to @martinjrobins, the plan is to merge the current branch (I've added an explicit warning that serialising models coupled to experiments is not yet supported), close this pull request and create a new one focused on expanding the basic model serialisation established here to support the Experiment class.

TomTranter · 2023-11-24T11:01:55Z

I can see why saving a model once coupled to a simulation doesn't work without extra development but the second example I posted where just saving and loading a model before putting into a sim with an experiment feels like it should work. It's a bit confusing having the sim save the model actually. I appreciate why it's been done that way because a sim is changing it's associated objects and Simulation probably contains too much logic rather than simply being a way to group together those components.

Saransh-cpp

Chiming in to review the type hints 🙂

Edit: this can be ignored, given that the main motive of this PR was not to add type hints

Saransh-cpp · 2023-11-24T11:15:43Z

pybamm/expression_tree/array.py

@@ -57,6 +57,30 @@ def __init__(
            name, domain=domain, auxiliary_domains=auxiliary_domains, domains=domains
        )

+    @classmethod
+    def _from_json(cls, snippet: dict):


Given that PyBaMM supports Python 3.8+, each file using the newer type hints mus import -

from __future__ import annotations

at the top to ensure backward compatibility. This can also be automated using the isort rules in ruff.

Saransh-cpp · 2023-11-24T11:18:00Z

pybamm/expression_tree/functions.py

@@ -244,6 +256,25 @@ class SpecificFunction(Function):
    def __init__(self, function, child):
        super().__init__(function, child)

+    @classmethod
+    def _from_json(cls, function: Callable, snippet: dict):


The type should be narrowed down if possible - Callable[[..., ..., ...], ...]. Similarly, the dict type should also be narrowed down - dict[..., ...].

rtimms · 2023-11-24T11:31:08Z

Thanks @pipliggins, this is a great addition!

My thoughts on serialising models for experiments...

The issue with trying to use a serialised model with an experiment is that the model changes depending on what the operating mode is -- you need to add an extra algebraic constraint to specify power or voltage (at least in the way we have formulated the model). This currently happens in Simulation.build_for_experiment, which creates the dict op_conds_to_built_models. For a serialised model to be used with any future experiment we would need to create the current/voltage/power control versions of the model at the point of serialisation, so they could all be accessed in the future. Not sure what the API for this would look like - Simulation expects to receive a single model, so there'd have to be some new function that sets up a simulation using e.g. a dict mapping the operating mode to the correct model.

An alternative approach would be to create a version of the model where the control is always implemented as an extra algebraic constraint, and then have an option in the simulation that uses this single model instead of creating a new model for each distinct operating mode. The control function would be something like (I_app - I_var)*I_switch + ... with similar switches for voltage and power, and everything would be controlled via input parameters. The drawback of this option is that it would make all models DAEs, but maybe that's not a huge deal in an experiment anyway. Would need to do some solver testing to see how performance is affected.

I don't think either of these options are ideal.

As mentioned in #3530, the logic for setting up experiments in the Simulation class is very complex, so perhaps a rehaul of that is in order before we decide how to handle serialising models for use with experiments.

And, of course, this will all break if we allow more generic experiment control in the future (#3530).

rtimms · 2023-11-24T11:34:26Z

docs/source/examples/notebooks/models/saving_models.ipynb

+    "# do the example\n",
+    "dfn_model = pybamm.lithium_ion.DFN()\n",
+    "dfn_sim = pybamm.Simulation(dfn_model)\n",
+    "dfn_sim.solve([0, 3600])\n",


This is a very minor comment, but I wonder if the example (and the warning) should encourage the use of sim.build rather than sim.solve? I have a feeling many users aren't aware you can even call sim.build, but it seems like you might want to create a model and save it without actually running a simulation.

I can imagine a workflow where people end up just doing a dummy solve for a short time before saving a model as they think they need to solve it first.

pipliggins · 2023-11-24T11:36:29Z

Chiming in to review the type hints 🙂

Edit: this can be ignored, given that the main motive of this PR was not to add type hints

Thanks @Saransh-cpp - I'll leave these for this PR but as I'm doing type hints elsewhere for #3497 this is useful!

TomTranter · 2023-11-24T11:42:40Z

Thanks @pipliggins, this is a great addition!

My thoughts on serialising models for experiments...

The issue with trying to use a serialised model with an experiment is that the model changes depending on what the operating mode is -- you need to add an extra algebraic constraint to specify power or voltage (at least in the way we have formulated the model). This currently happens in Simulation.build_for_experiment, which creates the dict op_conds_to_built_models. For a serialised model to be used with any future experiment we would need to create the current/voltage/power control versions of the model at the point of serialisation, so they could all be accessed in the future. Not sure what the API for this would look like - Simulation expects to receive a single model, so there'd have to be some new function that sets up a simulation using e.g. a dict mapping the operating mode to the correct model.

An alternative approach would be to create a version of the model where the control is always implemented as an extra algebraic constraint, and then have an option in the simulation that uses this single model instead of creating a new model for each distinct operating mode. The control function would be something like (I_app - I_var)*I_switch + ... with similar switches for voltage and power, and everything would be controlled via input parameters. The drawback of this option is that it would make all models DAEs, but maybe that's not a huge deal in an experiment anyway. Would need to do some solver testing to see how performance is affected.

I don't think either of these options are ideal.

As mentioned in #3530, the logic for setting up experiments in the Simulation class is very complex, so perhaps a rehaul of that is in order before we decide how to handle serialising models for use with experiments.

And, of course, this will all break if we allow more generic experiment control in the future (#3530).

I like option 2 better. The file size for a DFN with variables is already 13MB. Having lots of these will quickly add up

pipliggins · 2023-11-24T15:33:55Z

Thanks for your thoughts @TomTranter @rtimms - I would tend to agree that something like option 2 would be better. The size/number of files that could be written out if we ended up trying to serialise every model permutation would get very unwieldy pretty fast. The downside is that it probably has a higher overhead to implement.

As already discussed, the issue with going between single model <-> experiment is that the structure of the built model changes. The reason for forcing the model to be discretised before serialising is ultimately to reduce the complexity of what is being written out, and make it more portable to other solvers; the downside is that a once-serialised model cannot be edited (for example to add in experimental constraints) once it's read back in, as the original RHS/algebraic... etc are lost. The with/without experiment options also follow pretty different flows through the Simulation class to set up for solving. If there are already plans afoot to reorganize this somewhat, possibly associated with #3530, my suggestion would be to put a pin in the integration of the Experiment options with serialisation until those changes have been made - hopefully with an eye on making the with/without experiment workflows more integrated - and in the meantime merging the current PR as a serialisation V1.

@martinjrobins what do you think?

martinjrobins · 2023-11-24T17:53:11Z

I don't think we should do serialisation of model + experiment in this PR, the scope here was to serialize a single already built model, which is already complicated enough, and clearly there needs to be more discussion on how exactly we are going to support the serialisation of model + experiment.

It would be great if we could reformulate the models such that we could have a single model for all experiment steps! That would certainly make it simpler if the solve time wasn't affected too much. Definitely something to try out.

I take @TomTranter point about the sim being the object to save the model. There is a BaseModel.save_model function which is the natural place to put it, but since users generally interact with pybamm models via Simulation we wanted to add one there as well (which just calls the save_model function on the build model)

@rtimms @TomTranter, should we move the discussion on serialisation of model + experiment to a separate issue? We can then merge this in and use it as a basis for future work.

Fixes doctests error

pipliggins added 15 commits October 2, 2023 15:01

Draft a serialisation method

a4fedae

Create to_json() functions in corresponsing classes Working basic de/serialisation

Move deserialisation functions to Symbol classes

70b765d

Creates _from_json() functionality

Create Serialise class

4ea8108

Stores save_/load_model functions. Currently working for default models. Add errors, make accessible from Simulation

Serialised models can be plotted.

6694cb1

Option to save mesh, variables and geometry Draft notebook written Added warning if variables are not provided and try to plot

Add unit tests for to_json()

7fadee3

Allow saving of geometry where symbols are dict keys

25cb002

Put warning in for BaseModel - atm requires more model information to re-create the model.

Add _from_json tests for symbols without children

efa7888

(wip) testing: add draft de/serialisation tests

fbc8f6f

allow interpolant to be serialised fix concatenation with debug mode switch msmr warnings

(wip) tests: add _from_json tests with children

4745484

allow BaseModel to run without rhs

testing: add unit tests for Serialise() functions

80fc250

Add to_from_json test for Events

testing: save/load model tests

ac928ab

testing: Add integration tests

9e323d9

add to_json tests for meshes All but 3 int. tests passing, high accuracy diff failures

Add docs for serialisation

2934df4

update integration test to pass at lower accuracy Remove outputs from example notebook

Increase test coverage

66d8045

Fix minor style issues

d5dd21d

pipliggins requested a review from martinjrobins October 2, 2023 16:01

pipliggins linked an issue Oct 2, 2023 that may be closed by this pull request

proposal for serialisation of models #2787

Closed

martinjrobins reviewed Oct 3, 2023

View reviewed changes

Remove accidental SpatialOperator.diff() addition

6d63732

martinjrobins requested changes Oct 17, 2023

View reviewed changes

pipliggins added 2 commits October 19, 2023 16:24

Edits after review

0cc0aee

* Add pybamm version to JSON file * Re-word missing variable message * Refactor unary_operator _from_json()

Merge branch 'develop' into serialisation

2a72cf8

pipliggins requested a review from martinjrobins October 19, 2023 23:46

pipliggins added 3 commits October 20, 2023 14:52

Serialisation: fix integration tests

616c0d8

Reduce test tolerance of sei_asymmetric_ec_reaction_limited

8e32718

fix: change serialisation test accuracy

1e16b92

Required for macOS python<3.11

agriyakhetarpal mentioned this pull request Oct 31, 2023

Drop support for Python 3.8, support Python 3.12? #3390

Closed

pipliggins mentioned this pull request Nov 3, 2023

Add type hints to expression tree #3497

Closed

martinjrobins requested changes Nov 7, 2023

View reviewed changes

pipliggins added 2 commits November 7, 2023 18:31

Additional tests for codecov

62a46ef

More coverage updates to serialise and 1D meshes

5211233

martinjrobins self-requested a review November 16, 2023 10:31

martinjrobins approved these changes Nov 16, 2023

View reviewed changes

pipliggins and others added 3 commits November 16, 2023 16:37

Merge branch 'develop' into serialisation

a1ac313

Update CHANGELOG

afa187e

style: pre-commit fixes

b745317

Add error message for experiment

3fec37e

Saransh-cpp reviewed Nov 24, 2023

View reviewed changes

rtimms reviewed Nov 24, 2023

View reviewed changes

Update notebook to suggest build() not solve()

92e7c90

pipliggins and others added 4 commits November 27, 2023 15:39

Merge branch 'develop' into serialisation

95935a0

Add outputs to example notebook

04f4230

Fixes doctests error

style: pre-commit fixes

ca63509

Fix ruff errors

df35b91

martinjrobins merged commit 25b1e75 into pybamm-team:develop Nov 28, 2023
35 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Serialisation of models #3397

Serialisation of models #3397

pipliggins commented Oct 2, 2023 •

edited

review-notebook-app bot commented Oct 2, 2023

martinjrobins left a comment

martinjrobins left a comment

codecov bot commented Oct 20, 2023 •

edited

martinjrobins left a comment

brosaplanella commented Nov 15, 2023

martinjrobins left a comment

TomTranter commented Nov 23, 2023

pipliggins commented Nov 24, 2023

TomTranter commented Nov 24, 2023

Saransh-cpp left a comment •

edited

Saransh-cpp Nov 24, 2023 •

edited

Saransh-cpp Nov 24, 2023

rtimms commented Nov 24, 2023

rtimms Nov 24, 2023

pipliggins commented Nov 24, 2023 •

edited

TomTranter commented Nov 24, 2023

pipliggins commented Nov 24, 2023

martinjrobins commented Nov 24, 2023

Serialisation of models #3397

Serialisation of models #3397

Conversation

pipliggins commented Oct 2, 2023 • edited

Description

Type of change

Key checklist:

Further checks:

review-notebook-app bot commented Oct 2, 2023

martinjrobins left a comment

Choose a reason for hiding this comment

martinjrobins left a comment

Choose a reason for hiding this comment

codecov bot commented Oct 20, 2023 • edited

Codecov Report

martinjrobins left a comment

Choose a reason for hiding this comment

brosaplanella commented Nov 15, 2023

martinjrobins left a comment

Choose a reason for hiding this comment

TomTranter commented Nov 23, 2023

pipliggins commented Nov 24, 2023

TomTranter commented Nov 24, 2023

Saransh-cpp left a comment • edited

Choose a reason for hiding this comment

Saransh-cpp Nov 24, 2023 • edited

Choose a reason for hiding this comment

Saransh-cpp Nov 24, 2023

Choose a reason for hiding this comment

rtimms commented Nov 24, 2023

rtimms Nov 24, 2023

Choose a reason for hiding this comment

pipliggins commented Nov 24, 2023 • edited

TomTranter commented Nov 24, 2023

pipliggins commented Nov 24, 2023

martinjrobins commented Nov 24, 2023

pipliggins commented Oct 2, 2023 •

edited

codecov bot commented Oct 20, 2023 •

edited

Saransh-cpp left a comment •

edited

Saransh-cpp Nov 24, 2023 •

edited

pipliggins commented Nov 24, 2023 •

edited