Issue #1698 model splitter optimization #1699

Manangka · 2025-10-20T07:02:07Z

Fixes #

Description

Checklist

Links to correct issue
Update changelog, if changes affect users
PR title starts with Issue #nr, e.g. Issue #737
Unit tests were added
If feature added: Added/extended example
If feature added: Added feature to API documentation
If pixi.lock was changed: Ran pixi run generate-sbom and committed changes

…eading op the nc and zarr files

…al applicalble and some unittests are failing

# Conflicts: # pixi.lock

… the way the solution is updated in the split model

…idn't expect to recieve dask objects. Optimize the flow-transport model matcher

sonarqubecloud · 2025-10-20T07:02:45Z

Quality Gate passed

Issues
9 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

@Huite

Fixes #1698 # Description This is part 2 of fixing the performance issues with large model. In part 1 #1693 the modelsplitter has been optimized. In this PR the focus is on wiring the partitioned model. As @Huite pointed out in #1686 the performance bottleneck had to do with the fact that the same package had to be loaded from file multiple times while only a part of the file is actually needed. After digging around for a while i discovered that this had to do with the fact how we open de the dataset. `dataset = xr.open_dataset(path, **kwargs)` In the line above we don't specify anything chunk related. That has as a result that when you access the dataset the entire file has to be loaded from disk. By simply adding `chunks="auto"` this is no longer the case and a huge performance gain is achieved. There are some other changes related to setting chunking to auto. There are some parts of the code that don't expect to receive dask arrays. For instance you can use .item() on a dask array. Instead i now use .values[()]. I was also getting some errors when the to_netcdf method were called on the package. All of them had something to do with wrong/unsupported datatypes. In this PR you will find that an encoding is added for float16 types. And that in some packages the from_file method has been updated to ensure that he loaded type is converted to a supported type An unrelated change but performance wise significant change has been applied to the `_get_transport_models_per_flow_model` method. This method is used to match gwf models to gwt models so that gwfgwt exchanges can be created. This method was doing a full comparison of domains, which is expensive. There is also a method available that does the comparison on domain level. By switching to this method the matching algorithm becomes almost instantaneously. **NOTE** This PR has issue #1699 as a base. The base needs to altered to master once that PR is in **NOTE** This PR also improves the `dump` method **NOTE** some timmings: <img width="833" height="739" alt="image" src="https://github.com/user-attachments/assets/974c841c-0413-4433-8486-1abe98dc0715" /> <img width="843" height="215" alt="image" src="https://github.com/user-attachments/assets/c7082975-af35-4143-a6f9-860557b3eb09" /> <img width="842" height="705" alt="image" src="https://github.com/user-attachments/assets/383bf1a6-f028-4cb4-aa72-48ab95e84e3d" />  - [x] Links to correct issue - [ ] Update changelog, if changes affect users - [x] PR title starts with ``Issue #nr``, e.g. ``Issue #737`` - [ ] Unit tests were added - [ ] **If feature added**: Added/extended example - [ ] **If feature added**: Added feature to API documentation - [ ] **If pixi.lock was changed**: Ran `pixi run generate-sbom` and committed changes --------- Co-authored-by: JoerivanEngelen <joerivanengelen@hotmail.com>

Manangka added 17 commits October 13, 2025 19:13

Optimize modelsplitter

9ea9075

Remove old slice_model method

07cad1a

Clean up modelsplitter

0b631fa

Refactor modelsplitter

29b6f8d

Add documentation to the split method. More cleaning up

5457c49

Fix incorrect use of item()

6b71831

Directly call any() on the dask array

f72c774

Fix regrid error due to use of wrong interface

c201314

Rvert to earlier more efficient evaluation of has_overlap. Optimize r…

f19553c

…eading op the nc and zarr files

Revert chunk optimization when opening zarr or nc files. Its not genr…

9a99506

…al applicalble and some unittests are failing

Merge branch 'master' into model_splitter_optimization

372c1f7

# Conflicts: # pixi.lock

Apply review comment

8567903

Fix incorrect order of removing non-spatial dims

cbe5624

Fix lint error

f37972f

Refactor updating of th buy and ssm package after splitting. Refactor…

a32e989

… the way the solution is updated in the split model

Add comments to the ModelSplitter class

04f746e

Add chunking when opening netcdf files. Handle errors for code that d…

9c5134a

…idn't expect to recieve dask objects. Optimize the flow-transport model matcher

Manangka closed this Oct 20, 2025

Manangka mentioned this pull request Oct 20, 2025

Issue #1698 Model write optimization #1700

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Issue #1698 model splitter optimization #1699

Issue #1698 model splitter optimization #1699

Uh oh!

Manangka commented Oct 20, 2025

Uh oh!

sonarqubecloud bot commented Oct 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Issue #1698 model splitter optimization #1699

Issue #1698 model splitter optimization #1699

Uh oh!

Conversation

Manangka commented Oct 20, 2025

Description

Checklist

Uh oh!

sonarqubecloud bot commented Oct 20, 2025

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants