Rewrite simple and tiepoint modis interpolation in cython #38

djhoese · 2022-03-19T19:51:03Z

This is a continuation of #36 but includes mostly changes to the simple modis interpolation since that PR. Doing this though required restructuring of the cython modules so I added a _modis_utils.py. The results of a simple test with some dask array based processing:

This is about half the memory and half the processing time of a previous execution with the non-cython version. It also runs faster in the pure numpy case of a full granule, but not as fast as the #36 angle-based interpolation with pure numpy. So to put it another way, simple interpolation is faster and uses less memory than #36 if used with small and/or dask-based arrays, but it is slower than #36 for larger numpy arrays.

… module

mraspaud

Looks good, makes it much easier to follow with split functions! just one inline question.

.github/workflows/ci.yaml

djhoese · 2022-10-24T15:35:20Z

@mraspaud I think this is ready for re-review. Refactoring definitely fixed some bugs that were just "lucky" that they didn't break the final results.

I'll make more PRs in the future when the coveralls github action is further along and merged with the upstream github action. I will merge this after I do some final performance checks to make sure my refactoring didn't break anything.

djhoese · 2022-10-24T17:13:53Z

Here is creation of a true_color with geotiepoints main:

And here is this branch:

So 25s faster, less memory, and faster processing. Note this is computing the true_color and throwing away the chunks. I'm not exactly sure why the new version shows a near constant 8GB of memory usage at the second half of processing, but that isn't the interpolation part of the processing anyway.

djhoese · 2022-10-24T18:34:27Z

And here's what that looks like with this PR just before my latest refactorings:

So the graph in my last comment (how this PR is now) is even faster and more memory efficient than before. Amazing.

djhoese · 2022-10-24T18:36:47Z

Nnnoooo...the true_color geotiff is bad! It may be a nearest neighbor thing, let me check.

djhoese · 2022-10-24T18:59:35Z

False alarm, it is a bad default setting in the EWA resampler in pyresample. Separate PR coming for that in pyresample.

djhoese · 2022-10-24T20:20:56Z

So bottom line this PR makes this code better in every way. Let's just merge it. No need to review it. 😉

mraspaud

Looks good, just a couple refactoring requests

geotiepoints/_modis_interpolator.pyx

mraspaud · 2022-10-25T10:43:32Z

Regarding the performance results: did you test both modis_interpolator and simple_modis_interpolator? I think you presented just one result, which one is it?

djhoese · 2022-10-25T12:04:27Z

Regarding the performance results: did you test both modis_interpolator and simple_modis_interpolator? I think you presented just one result, which one is it?

Good question. I realize now that this PR title is misleading as this PR has grown. I'll update that. The most recent profiling was all using Satpy's defaults which for the 1km data I was using is to use the modis_interpolator and not the simple. There were results of the simple interpolator somewhere else but now I can't find them. I'll do some more profiling after I clean up the code based on your suggestions.

Note I realized after doing it that some of these results were likely with Cython profiling/tracing enabled so likely slower than they will be in production.

mraspaud

LGTM

djhoese · 2022-10-25T15:40:58Z

main tiepoint interpolation:

main simple:

This PR tiepoint:

This PR simple:

So faster and more memory efficient, but some of these results seemed heavily dependent on what the rest of my system was doing. The simple interpolation definitely doesn't benefit from this work as much as the tiepoint-based interpolation does as it uses scipy to do the majority of the interpolation/extrapolation. I'm still very surprised by the simple interpolation in main. It is almost identical to the cython one for this PR.

djhoese · 2022-10-25T15:43:15Z

The other thing to keep in mind with the above plots is that the interpolation is only the first 10 seconds or so since I am also doing EWA resampling and then computation of the arrays. So the peak memory at the right side of the plot is an EWA problem not a geotiepoint/modis interpolation problem.

mraspaud · 2022-10-25T16:29:26Z

Great work, thanks for wrapping this up!

djhoese added 30 commits March 12, 2022 20:21

Initial cython version of modis interpolator

bb547f1

More cythonizing

9ea5e10

More cythonizing of expand_tiepoint_array_5km

f06b707

Fix modis interpolation tests to have consistent types

b64aa73

Update modis interpolator tests to pytest

837f7b4

Add minimal typing to modis interpolator cython

9ee6810

Update modis interpolator use of 64-bit floats

9eae62d

Convert 1km coords method to C loop

8c46e0f

Rewrite expand_tiepoints for 1km modis data

5564829

More cython conversion

38f9bba

Optimize cython version of _compute_expansion_alignment

a945952

Rearrange modis interpolation to avoid unnecessary computation

d486fd3

Cythonize xyz2lonlat in modis interpolation

f0b6e84

Refactor modis interpolation cython into more methods

41e9763

Refactor modis interpolation to use more memory views

f0c280e

More memory view work in modis interpolator

2c401c7

More refactoring

af3d737

Remove unnecessary 64-bit functions in modis interpolation

315c857

Convert expand 5km to memory views (ugly)

5604fe7

Refactor modis interpolation to use more memory views

ad9cc95

Modis interpolation cleanup

6a3a8e9

Rearrange modis interpolation cython for easier reading

ddb249e

Optimize a_track calculation

bff6a4f

Optimize 1km coords generation in modis interpolator

763352b

Refactor modis interpolation to put public function at the top of the…

9fa6382

… module

Remove copy of corner arrays in modis interpolation

989db17

Initial conversion of simple modis interpolation to cython

bd82993

Restructure modis interpolation to have shared utils module

257836e

Refactor simple modis interp to use single xyz array

f051470

More memory views in simple modis interp

1bdc93a

djhoese added 5 commits October 20, 2022 15:39

Refactoring 1km expand tiepoint

3215da3

Refactoring 1km expand tiepoint

d351e0c

Refactoring 1km expand tiepoint

0105b65

Rename some methods

ead104d

Start refactoring 5km tiepoint expand

396772e

mraspaud reviewed Oct 24, 2022

View reviewed changes

.github/workflows/ci.yaml Show resolved Hide resolved

djhoese added 4 commits October 24, 2022 09:58

More simplifying of 5km expand tiepoint

6d7806b

Separate 5km expand tiepoint into methods

4c165ff

More 5km expand refactoring

2f804fa

More 5km expand refactoring

2dd6a66

djhoese mentioned this pull request Oct 24, 2022

Fix EWA default for 'weight_delta_max' to match docstring pytroll/pyresample#463

Merged

3 tasks

mraspaud reviewed Oct 25, 2022

View reviewed changes

geotiepoints/_modis_interpolator.pyx Outdated Show resolved Hide resolved

geotiepoints/_modis_interpolator.pyx Outdated Show resolved Hide resolved

More 5km refactoring to reduce number of methods

ca4acfa

mraspaud approved these changes Oct 25, 2022

View reviewed changes

djhoese merged commit 1f0ad49 into pytroll:main Oct 25, 2022

djhoese deleted the optimize-simple-modis-cython branch October 25, 2022 15:46

djhoese changed the title ~~Rewrite simple modis interpolation in cython~~ Rewrite simple and tiepoint modis interpolation in cython Oct 25, 2022

This was referenced Aug 30, 2023

Make the interpolators dask-compatible #18

Closed

Upgrade to Cython 3+ in building #57

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rewrite simple and tiepoint modis interpolation in cython #38

Rewrite simple and tiepoint modis interpolation in cython #38

djhoese commented Mar 19, 2022

mraspaud left a comment

djhoese commented Oct 24, 2022

djhoese commented Oct 24, 2022

djhoese commented Oct 24, 2022

djhoese commented Oct 24, 2022

djhoese commented Oct 24, 2022

djhoese commented Oct 24, 2022

mraspaud left a comment

mraspaud commented Oct 25, 2022

djhoese commented Oct 25, 2022

mraspaud left a comment

djhoese commented Oct 25, 2022

djhoese commented Oct 25, 2022

mraspaud commented Oct 25, 2022

Rewrite simple and tiepoint modis interpolation in cython #38

Rewrite simple and tiepoint modis interpolation in cython #38

Conversation

djhoese commented Mar 19, 2022

mraspaud left a comment

Choose a reason for hiding this comment

djhoese commented Oct 24, 2022

djhoese commented Oct 24, 2022

djhoese commented Oct 24, 2022

djhoese commented Oct 24, 2022

djhoese commented Oct 24, 2022

djhoese commented Oct 24, 2022

mraspaud left a comment

Choose a reason for hiding this comment

mraspaud commented Oct 25, 2022

djhoese commented Oct 25, 2022

mraspaud left a comment

Choose a reason for hiding this comment

djhoese commented Oct 25, 2022

djhoese commented Oct 25, 2022

mraspaud commented Oct 25, 2022