Reduce ridge filters memory footprints #6509

tkumpumaki · 2022-09-08T17:45:00Z

Description

Reduce ridge filters memory footprints by not storing intermediate results for final max computation. Instead after each sigma take max between last and current sigma result.

Closes #6507

For reviewers

Check that the PR title is short, concise, and will make sense 1 year
later.
Check that new functions are imported in corresponding __init__.py.
Check that new features, API changes, and deprecations are mentioned in
doc/release/release_dev.rst.
There is a bot to help automate backporting a PR to an older branch. For
example, to backport to v0.19.x after merging, add the following in a PR
comment: @meeseeksdev backport to v0.19.x
To run benchmarks on a PR, add the run-benchmark label. To rerun, the label
can be removed and then added again. The benchmark output can be checked in
the "Actions" tab.

…sult and taking max between iterations.

pep8speaks · 2022-09-08T17:45:04Z

Hello @tkumpumaki! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

In the file benchmarks/benchmark_filters.py:

Line 113:80: E501 line too long (81 > 79 characters)
Line 126:80: E501 line too long (88 > 79 characters)

In the file skimage/filters/ridges.py:

Line 462:22: E127 continuation line over-indented for visual indent

Comment last updated at 2022-09-11 12:21:25 UTC

mkcor

@tkumpumaki thank you so much for your contribution!

mkcor · 2022-09-09T10:53:31Z

@scikit-image/core since memory use is greatly improved with this PR, should we include a benchmark?

rfezzani

Excellent! Thank you @tkumpumaki 😉
I simply suggest a small code formatting, otherwise every think seems fine. @mkcor, not sure if a benchmark is necessary, but up to other @scikit-image/core members 🙂

skimage/filters/ridges.py

lagru · 2022-09-09T13:37:24Z

not sure if a benchmark is necessary, but up to other https://github.com/orgs/scikit-image/teams/core members

I'd say yes. @tkumpumaki let us know if you want to do that yourself or we will happily do so for you if you are not that familiar with asv.

tkumpumaki · 2022-09-09T13:52:42Z

not sure if a benchmark is necessary, but up to other https://github.com/orgs/scikit-image/teams/core members

I'd say yes. @tkumpumaki let us know if you want to do that yourself or we will happily do so for you if you are not that familiar with asv.

I don't have a slightest clue how to test memory usage with the asv as python is not my main weapon. ;)
I would guess that CPU time is not that different as same amount of computation is done.

grlee77 · 2022-09-09T14:08:45Z

I don't have a slightest clue how to test memory usage with the asv as python is not my main weapon. ;)

no worries @tkumpumaki. @lagru, are you interested in adding the benchmark? If so that is great and we can wait for that, but if no one plans to work on it soon let's not hold up the PR over it.

Certainly the memory usage will be lower with this change and I doubt there is a substantial change in runtime. There will be more calls to maximum, but each one does less work and the maximum operations are not the most expensive operations in this function in any case. Even if there were a small performance decrease, I would still vote to merge this for the clear memory efficiency gain!

lagru · 2022-09-09T14:09:49Z

@grlee77 happy to. I'll do it in a minute.

lagru · 2022-09-09T15:03:19Z

Ony my machine

$ asv continuous main 7535320f661c2734321edbfe24f9fb3e5ae3bc6f -b RidgeFilters
· Creating environments
· Discovering benchmarks
·· Uninstalling from virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv
·· Installing 7535320f <ridge-operator-less-memory-use~1> into virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv.
· Running 10 total benchmarks (2 commits * 1 environments * 5 benchmarks)
[  0.00%] · For scikit-image commit 7535320f <ridge-operator-less-memory-use~1> (round 1/1):
[  0.00%] ·· Benchmarking virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv
[ 10.00%] ··· benchmark_filters.RidgeFilters.peakmem_frangi                                        394M
[ 20.00%] ··· benchmark_filters.RidgeFilters.peakmem_hessian                                       394M
[ 30.00%] ··· benchmark_filters.RidgeFilters.peakmem_meijering                                     380M
[ 40.00%] ··· benchmark_filters.RidgeFilters.peakmem_sato                                          269M
[ 50.00%] ··· benchmark_filters.RidgeFilters.peakmem_setup                                         143M
[ 50.00%] · For scikit-image commit 6342ec59 <main> (round 1/1):
[ 50.00%] ·· Building for virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv.
[ 50.00%] ·· Benchmarking virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv
[ 60.00%] ··· benchmark_filters.RidgeFilters.peakmem_frangi                                        528M
[ 70.00%] ··· benchmark_filters.RidgeFilters.peakmem_hessian                                       523M
[ 80.00%] ··· benchmark_filters.RidgeFilters.peakmem_meijering                                     412M
[ 90.00%] ··· benchmark_filters.RidgeFilters.peakmem_sato                                          332M
[100.00%] ··· benchmark_filters.RidgeFilters.peakmem_setup                                         145M
before           after         ratio
[6342ec59]       [7535320f]
<main>           <ridge-operator-less-memory-use~1>
-            332M             269M     0.81  benchmark_filters.RidgeFilters.peakmem_sato
-            523M             394M     0.75  benchmark_filters.RidgeFilters.peakmem_hessian
-            528M             394M     0.75  benchmark_filters.RidgeFilters.peakmem_frangi

SOME BENCHMARKS HAVE CHANGED SIGNIFICANTLY.
PERFORMANCE INCREASED.

Co-authored-by: Riadh Fezzani <rfezzani@gmail.com>

lagru · 2022-09-09T15:06:56Z

@tkumpumaki, I pushed to your branch. To avoid conflicts, I recommend pulling these changes if and before you add commits yourself again: git pull origin ridge-operator-less-memory-use (assuming your fork is associated with origin).

grlee77 · 2022-09-09T18:53:30Z

The stochastic RANSAC failure on CI is unrelated.

grlee77 · 2022-09-09T18:58:20Z

Thanks for the quick update, @lagru . While we are adding the benchmarks, can you also add time_meijering, etc. as well?

lagru · 2022-09-11T12:26:49Z

It seems that the label run-benchmarks triggers something but the workflow is shown as "Skipped" nevertheless. Does someone know why?

lagru · 2022-09-11T13:01:56Z

My machine, with CPU benchmarks added

$ asv continuous main HEAD -b RidgeFilters
· Creating environments
· Discovering benchmarks
·· Uninstalling from virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv
·· Installing 1d0e9d37 <pr6509-ridge-operator-less-memory-use> into virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv.
· Running 18 total benchmarks (2 commits * 1 environments * 9 benchmarks)
[  0.00%] · For scikit-image commit bfcb6298 <main> (round 1/2):
[  0.00%] ·· Building for virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv..
[  0.00%] ·· Benchmarking virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv
[ 16.67%] ··· Running (benchmark_filters.RidgeFilters.time_frangi--)...
[ 25.00%] ··· Running (benchmark_filters.RidgeFilters.time_sato--).
[ 25.00%] · For scikit-image commit 1d0e9d37 <pr6509-ridge-operator-less-memory-use> (round 1/2):
[ 25.00%] ·· Building for virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv.
[ 25.00%] ·· Benchmarking virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv
[ 41.67%] ··· Running (benchmark_filters.RidgeFilters.time_frangi--)...
[ 50.00%] ··· Running (benchmark_filters.RidgeFilters.time_sato--).
[ 50.00%] · For scikit-image commit 1d0e9d37 <pr6509-ridge-operator-less-memory-use> (round 2/2):
[ 50.00%] ·· Benchmarking virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv
[ 52.78%] ··· benchmark_filters.RidgeFilters.peakmem_frangi                                        394M
[ 55.56%] ··· benchmark_filters.RidgeFilters.peakmem_hessian                                       394M
[ 58.33%] ··· benchmark_filters.RidgeFilters.peakmem_meijering                                     380M
[ 61.11%] ··· benchmark_filters.RidgeFilters.peakmem_sato                                          268M
[ 63.89%] ··· benchmark_filters.RidgeFilters.peakmem_setup                                         143M
[ 66.67%] ··· benchmark_filters.RidgeFilters.time_frangi                                        1.35±0s
[ 69.44%] ··· benchmark_filters.RidgeFilters.time_hessian                                       1.36±0s
[ 72.22%] ··· benchmark_filters.RidgeFilters.time_meijering                                     4.89±0s
[ 75.00%] ··· benchmark_filters.RidgeFilters.time_sato                                          935±7ms
[ 75.00%] · For scikit-image commit bfcb6298 <main> (round 2/2):
[ 75.00%] ·· Building for virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv.
[ 75.00%] ·· Benchmarking virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv
[ 77.78%] ··· benchmark_filters.RidgeFilters.peakmem_frangi                                        527M
[ 80.56%] ··· benchmark_filters.RidgeFilters.peakmem_hessian                                       527M
[ 83.33%] ··· benchmark_filters.RidgeFilters.peakmem_meijering                                     448M
[ 86.11%] ··· benchmark_filters.RidgeFilters.peakmem_sato                                          336M
[ 88.89%] ··· benchmark_filters.RidgeFilters.peakmem_setup                                         145M
[ 91.67%] ··· benchmark_filters.RidgeFilters.time_frangi                                        1.37±0s
[ 94.44%] ··· benchmark_filters.RidgeFilters.time_hessian                                       1.37±0s
[ 97.22%] ··· benchmark_filters.RidgeFilters.time_meijering                                  4.88±0.01s
[100.00%] ··· benchmark_filters.RidgeFilters.time_sato                                          941±4ms
before           after         ratio
[bfcb6298]       [1d0e9d37]
<main>           <pr6509-ridge-operator-less-memory-use>
-            448M             380M     0.85  benchmark_filters.RidgeFilters.peakmem_meijering
-            336M             268M     0.80  benchmark_filters.RidgeFilters.peakmem_sato
-            527M             394M     0.75  benchmark_filters.RidgeFilters.peakmem_hessian
-            527M             394M     0.75  benchmark_filters.RidgeFilters.peakmem_frangi

SOME BENCHMARKS HAVE CHANGED SIGNIFICANTLY.
PERFORMANCE INCREASED.

grlee77 · 2022-09-16T14:05:30Z

Hi @tkumpumaki, I merged a larger refactor of ridge filters by @anntzer via #6446 today. It has created conflicts here, but more importantly it had also refactored these same sections and removed the upfront memory allocations!

Given that, it may be that no code changes are necessary in ridge.py for this PR? In that case, please just update the issue title to something like "add ridge filter benchmarks".

My apologies for not realizing the full overlap between these two and pointing it out earlier. Let us know if you need help resolving conflicts here.

anntzer · 2022-09-16T15:28:23Z

That PR removed the large upfront memory allocations but still first computes the filters for all sigmas before taking the max (so there's a large memory use just before the max is taken); it may still be useful to replace that by computing the pairwise maxes after each sigma computation.

…-memory-use # Conflicts: # skimage/filters/ridges.py

tkumpumaki · 2022-09-19T17:15:05Z

I merged main branch back to this branch and replaced ridges.py with the new one from the main. Then added memory fix again.

lagru · 2022-09-20T10:04:42Z

I'm kind of confused by the commit hash that is used by the benchmark workflow which is d3e5df6. It looks like a merge commit but is not listed on this PR...

lagru · 2022-09-20T10:46:00Z

But it shows an decreased memory use locally:

Click to expand

$ asv continuous --show-stderr --split main pr6509-ridge-operator-less-memory-use -b RidgeFilters
· Creating environments
· Discovering benchmarks
·· Uninstalling from virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv
·· Installing 4a792387 <pr6509-ridge-operator-less-memory-use> into virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv.
· Running 18 total benchmarks (2 commits * 1 environments * 9 benchmarks)
[  0.00%] · For scikit-image commit daa991ad <main> (round 1/2):
[  0.00%] ·· Building for virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv................................................................................
[  0.00%] ·· Benchmarking virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv
[ 16.67%] ··· Running (benchmark_filters.RidgeFilters.time_frangi--)...
[ 25.00%] ··· Running (benchmark_filters.RidgeFilters.time_sato--).
[ 25.00%] · For scikit-image commit 4a792387 <pr6509-ridge-operator-less-memory-use> (round 1/2):
[ 25.00%] ·· Building for virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv.
[ 25.00%] ·· Benchmarking virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv
[ 41.67%] ··· Running (benchmark_filters.RidgeFilters.time_frangi--)...
[ 50.00%] ··· Running (benchmark_filters.RidgeFilters.time_sato--).
[ 50.00%] · For scikit-image commit 4a792387 <pr6509-ridge-operator-less-memory-use> (round 2/2):
[ 50.00%] ·· Benchmarking virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv
[ 52.78%] ··· benchmark_filters.RidgeFilters.peakmem_frangi                                        332M
[ 55.56%] ··· benchmark_filters.RidgeFilters.peakmem_hessian                                       332M
[ 58.33%] ··· benchmark_filters.RidgeFilters.peakmem_meijering                                     279M
[ 61.11%] ··· benchmark_filters.RidgeFilters.peakmem_sato                                          268M
[ 63.89%] ··· benchmark_filters.RidgeFilters.peakmem_setup                                         145M
[ 66.67%] ··· benchmark_filters.RidgeFilters.time_frangi                                      5.49±0.2s
[ 69.44%] ··· benchmark_filters.RidgeFilters.time_hessian                                    5.58±0.05s
[ 72.22%] ··· benchmark_filters.RidgeFilters.time_meijering                                  5.18±0.04s
[ 75.00%] ··· benchmark_filters.RidgeFilters.time_sato                                       4.80±0.03s
[ 75.00%] · For scikit-image commit daa991ad <main> (round 2/2):
[ 75.00%] ·· Building for virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv.
[ 75.00%] ·· Benchmarking virtualenv-py3.10-cython-numpy1.23-pooch-pythran-scipy-virtualenv
[ 77.78%] ··· benchmark_filters.RidgeFilters.peakmem_frangi                                        382M
[ 80.56%] ··· benchmark_filters.RidgeFilters.peakmem_hessian                                       382M
[ 83.33%] ··· benchmark_filters.RidgeFilters.peakmem_meijering                                     329M
[ 86.11%] ··· benchmark_filters.RidgeFilters.peakmem_sato                                          318M
[ 88.89%] ··· benchmark_filters.RidgeFilters.peakmem_setup                                         143M
[ 91.67%] ··· benchmark_filters.RidgeFilters.time_frangi                                     5.14±0.03s
[ 94.44%] ··· benchmark_filters.RidgeFilters.time_hessian                                    5.35±0.09s
[ 97.22%] ··· benchmark_filters.RidgeFilters.time_meijering                                  5.13±0.02s
[100.00%] ··· benchmark_filters.RidgeFilters.time_sato                                        4.55±0.1s
before           after         ratio
[daa991ad]       [4a792387]
<main>           <pr6509-ridge-operator-less-memory-use>
-            382M             332M     0.87  benchmark_filters.RidgeFilters.peakmem_frangi
-            382M             332M     0.87  benchmark_filters.RidgeFilters.peakmem_hessian
-            329M             279M     0.85  benchmark_filters.RidgeFilters.peakmem_meijering
-            318M             268M     0.84  benchmark_filters.RidgeFilters.peakmem_sato

SOME BENCHMARKS HAVE CHANGED SIGNIFICANTLY.
PERFORMANCE INCREASED.

rfezzani

LGTM, simply recommending the use of np.zeros_like when possible.

skimage/filters/ridges.py

Co-authored-by: Riadh Fezzani <rfezzani@gmail.com>

lagru · 2022-09-20T11:46:34Z

Thanks everyone!

mkcor · 2022-09-24T13:04:34Z

I'm kind of confused by the commit hash that is used by the benchmark workflow which is d3e5df6. It looks like a merge commit but is not listed on this PR...

Right... It looks as if CI, instead of running directly for the latest commit (i.e., 4a79238), would merge this very commit into an ad hoc branch before running it.

see: scikit-image/scikit-image#6149 scikit-image/scikit-image#6440 scikit-image/scikit-image#6446 scikit-image/scikit-image#6509 These fix various bugs, simplify the implementaiton and reduce the memory footprint

…y footprint) (#423) related to #419 A large overhaul of the ridge filters, addressing inaccuracies and errors has been implemented for scikit-image 0.20. This PR ports the same changes to these functions to cuCIM. upstream PRs: - scikit-image/scikit-image#6149 - scikit-image/scikit-image#6440 - scikit-image/scikit-image#6446 - scikit-image/scikit-image#6509 These fix various bugs, simplify the implementation and reduce the memory footprint Authors: - Gregory Lee (https://github.com/grlee77) Approvers: - Gigon Bae (https://github.com/gigony) URL: #423

Reduce ridge filters memory footprints by not storing intermediate re…

0b6baf3

…sult and taking max between iterations.

Clean PEP8 whitespace issue.

7535320

mkcor approved these changes Sep 9, 2022

View reviewed changes

rfezzani approved these changes Sep 9, 2022

View reviewed changes

skimage/filters/ridges.py Outdated Show resolved Hide resolved

Add memory benchmarks for ridge filters

3618693

lagru added 📊 run-benchmark 📈 type: Performance labels Sep 9, 2022

Reformat indent in math expression

784ad4a

Co-authored-by: Riadh Fezzani <rfezzani@gmail.com>

lagru approved these changes Sep 9, 2022

View reviewed changes

Add CPU benchmarks for ridge filters

1d0e9d3

lagru added 📊 run-benchmark and removed 📊 run-benchmark labels Sep 11, 2022

lagru requested a review from grlee77 September 11, 2022 13:03

lagru added the 👍 OK to merge label Sep 11, 2022

lagru mentioned this pull request Sep 13, 2022

"run-benchmarks" label is triggering workflow which is then skipped #6518

Closed

grlee77 added 📊 run-benchmark and removed 📊 run-benchmark labels Sep 16, 2022

mkcor mentioned this pull request Sep 19, 2022

2022's calendar of community management #6165

Closed

tkumpumaki added 2 commits September 19, 2022 18:47

Merge remote-tracking branch 'upstream/main' into ridge-operator-less…

7c4c798

…-memory-use # Conflicts: # skimage/filters/ridges.py

Remake memory footprint fix.

4a79238

lagru added 📊 run-benchmark and removed 📊 run-benchmark labels Sep 19, 2022

lagru approved these changes Sep 20, 2022

View reviewed changes

lagru requested review from rfezzani and mkcor September 20, 2022 10:52

rfezzani approved these changes Sep 20, 2022

View reviewed changes

skimage/filters/ridges.py Outdated Show resolved Hide resolved

skimage/filters/ridges.py Outdated Show resolved Hide resolved

skimage/filters/ridges.py Outdated Show resolved Hide resolved

Use zeros_like where possible

c57e23a

Co-authored-by: Riadh Fezzani <rfezzani@gmail.com>

lagru merged commit d549c79 into scikit-image:main Sep 20, 2022

jarrodmillman added this to the 0.20 milestone Oct 4, 2022

grlee77 mentioned this pull request Nov 1, 2022

improved implementation of ridge filters (bug fixes and reduced memory footprint) rapidsai/cucim#423

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce ridge filters memory footprints #6509

Reduce ridge filters memory footprints #6509

tkumpumaki commented Sep 8, 2022 •

edited by lagru

pep8speaks commented Sep 8, 2022 •

edited

mkcor left a comment

mkcor commented Sep 9, 2022

rfezzani left a comment

lagru commented Sep 9, 2022

tkumpumaki commented Sep 9, 2022

grlee77 commented Sep 9, 2022

lagru commented Sep 9, 2022

lagru commented Sep 9, 2022

lagru commented Sep 9, 2022

grlee77 commented Sep 9, 2022

grlee77 commented Sep 9, 2022

lagru commented Sep 11, 2022

lagru commented Sep 11, 2022

grlee77 commented Sep 16, 2022

anntzer commented Sep 16, 2022

tkumpumaki commented Sep 19, 2022

lagru commented Sep 20, 2022

lagru commented Sep 20, 2022

rfezzani left a comment

lagru commented Sep 20, 2022

mkcor commented Sep 24, 2022

Reduce ridge filters memory footprints #6509

Reduce ridge filters memory footprints #6509

Conversation

tkumpumaki commented Sep 8, 2022 • edited by lagru

Description

For reviewers

pep8speaks commented Sep 8, 2022 • edited

Comment last updated at 2022-09-11 12:21:25 UTC

mkcor left a comment

Choose a reason for hiding this comment

mkcor commented Sep 9, 2022

rfezzani left a comment

Choose a reason for hiding this comment

lagru commented Sep 9, 2022

tkumpumaki commented Sep 9, 2022

grlee77 commented Sep 9, 2022

lagru commented Sep 9, 2022

lagru commented Sep 9, 2022

lagru commented Sep 9, 2022

grlee77 commented Sep 9, 2022

grlee77 commented Sep 9, 2022

lagru commented Sep 11, 2022

lagru commented Sep 11, 2022

grlee77 commented Sep 16, 2022

anntzer commented Sep 16, 2022

tkumpumaki commented Sep 19, 2022

lagru commented Sep 20, 2022

lagru commented Sep 20, 2022

rfezzani left a comment

Choose a reason for hiding this comment

lagru commented Sep 20, 2022

mkcor commented Sep 24, 2022

tkumpumaki commented Sep 8, 2022 •

edited by lagru

pep8speaks commented Sep 8, 2022 •

edited