Implementation of ndimage filters #3184

coderforlife · 2020-03-11T01:25:27Z

So far this PR includes improved correlate and convolve functions (in terms of speed, uses similar technique to #3179) along with implementations of correlate1d and convolve1d along with tests for them. The underlying kernel creation has been generalized so that it can be used to implement all of the other filters and those will be progressively added to this PR (I have an implementation of them but haven't tested them yet).

This works to address #2099 and #3111.

Current status of tests is that it is passing 7120 tests (in test_filters.py) and failing 8 (they are all with the 1d functions along axis 0 using mode=mirror with 4D images but no other axis or mode or less-dimension image).

This is the first step towards a complete ndimage.filter suite. The functions here are designed to be flexible so that all other ndimage.filter functions can be created.

All other filters will be based on this core set of code.

jakirkham · 2020-03-11T17:28:01Z

I wonder if we should exposed the 1-D convolve as cupy.convolve (analogous to numpy.convolve). Thoughts? 🙂

coderforlife · 2020-03-11T21:09:09Z

@jakirkham It is possible, a quick idea of how to do it would be. Note I have never used numpy.convolve or numpy.correlate so I don't know details like what do they do with >1D data or how some mode center the arrays. However, it gives the basic idea.

def convolve(a, v, mode='full'):
    if mode not in ('full', 'same', 'valid'): raise ValueError('mode') 

    # Make both 1D arrays and make sure `a` is the larger array
    a, v = a.ravel(), v.ravel()
    if len(a) == 0: raise ValueError('a cannot be empty')
    if len(v) == 0: raise ValueError('v cannot be empty')
    if len(v) > len(a): a, v = v, a

    # Deal with full padding of data
    if mode == 'full':
        pad = len(v) - 1
        a = cupy.pad(a, (pad // 2, (pad + 1) // 2)
    
    # Perform convolution
    out = cupyx.scipy.ndimage.convolve1d(a, v, mode='constant')

    # Return result
    if mode == 'valid':
        remove = 1 - len(v)
        out = out[remove//2 : -(remove+1)//2]
    return out

The downsides of this compared to numpy.convolve():

The default mode of 'full' requires padding the data (and thus copying the data).
The ndimage.convolve1d() function assumes that one of a or v is relatively small. If you were to use this for a cross-correlation of a large dataset (and thus len(a) == len(v)), it would likely have low performance.

coderforlife · 2020-03-11T23:10:31Z

The 8 tests that are not passing are actually apparently a bug in Scipy. I have submitted a bug report over there: scipy/scipy#11661. It does not handle mirror correctly for length-1 dimensions with the 1D functions.

First change (the w[] to weights[] I forgot to do earlier) was the most major.

Bad copy-paste naming of the tests along with avoiding a scipy bug.

awthomp · 2020-03-12T14:17:34Z

@coderforlife -- I'm interested in CuPy's support/implementation of numpy.convolve and numpy.correlate for cusignal. I'd be more than happy to test and guide your implementation.

I don't know details like what do they do with >1D data or how some mode center the arrays
For NumPy's convolve/correlate functions, only 1D arrays are supported. This is actually probably the motivation for creating 2D support in ndimage. As you mentioned in your post, the key component here is how padding is handled in the implementation; and I see that as the big difference between doing a 1D ndimage.convolve or ndimage.correlate as well.

I took a quick-ish spin of ndimage.correlate and noticed that it doesn't do the 'sliding window' thing that numpy.convolve and numpy.correlate do. This 'sliding dot product' is the key difference between the ndimage and standard numpy implementation, IMO.

Anyway, as you may know, numpy.correlate and numpy.convolve are basically the same algorithm. For convolve, the filter is mirrored left-to-right before beginning the sliding dot product. To step through an example:

a = [1, 2, 3]
b = [0, 1, 0.5]
y = np.convolve(a, b, mode="full")

Yields:
array([0. , 1. , 2.5, 4. , 1.5])

Convolution/correlation is a point by point overlapping and sliding dot product that starts with a mirrored version of the filter (b) and the data (a). The last element in the mirrored filter is overlapped with the first element of the data, and the dot product is taken as the filter moves element by element over the data. To be explicit:

mirrored_b (denoted by m_b), used for calculation = [0.5, 1, 0]

The convolution works like:

y[0] = m_b[2] * a[0] = 0 * 1 = 0
y[1] = m_b[2] * a[1] + m_b[1] * a[0] = 0 * 2 + 1 * 1 = 1
y[2] = m_b[2] * a[2] + m_b[1] * a[1] + m_b[0] * a[0] = 0 * 3 + 1 * 2 + 0.5 * 1 = 2.5
y[3] = m_b[1] * a[2] + m_b[0] * a[1] = 1 * 3 + 0.5 * 2 = 4
y[4] = m_b[0] * a[2] = 0.5 * 3 = 1.5

Here's a visualization of what's going on too: https://en.wikipedia.org/wiki/Convolution#/media/File:Convolution_of_box_signal_with_itself2.gif

Again, correlation is very similar, but you don't have to flip the filter component (b, in this case).

coderforlife · 2020-03-12T14:58:31Z

@awthomp I know that convolution and correlation are essentially the same thing with mirrored 'kernels', 'weights', or 'filter' depending on what you call the smaller of the two arrays. ndimage uses that fact. Also, the ndimage algorithm does use overlap-add. The ElementwiseKernel does the following for each element: sum every neighbourhood value multiplied by the corresponding weight. This is exactly what you describe. It may not look like it is doing this since it isn't "sliding" across but instead doing it per-element.

For the 1D case, it does this along an axis instead of in all dimensions. Unlike numpy.convolve/correlate it handles the edges differently.

The following code demonstrates the equality:

a = np.arange(24)
b = np.ones(3)
(np.convolve(a, b, mode="same") == ndi.convolve1d(a, b, mode="constant")).all()

where ndi is scipy.ndimage. The only other reasonable way to do a convolution is through FFT and it definitely doesn't do that.

I have corrected the code from above to properly align the output with what numpy.convolve generates and forbid non-1D arrays (and one other bug I had), and cupy.convolve could be implemented as:

def convolve(a, v, mode='full'):
    if mode not in ('full', 'same', 'valid'): raise ValueError('mode') 

    # Check both arrays and make sure `a` is the larger array
    if a.ndim != 1: raise ValueError('a must be 1D')
    if v.ndim != 1: raise ValueError('v must be 1D')
    if len(a) == 0: raise ValueError('a cannot be empty')
    if len(v) == 0: raise ValueError('v cannot be empty')
    if len(v) > len(a): a, v = v, a

    # Deal with full padding of data
    if mode == 'full':
        pad = len(v) - 1
        a = cupy.pad(a, ((pad+1)//2, pad//2), 'constant')
    
    # Perform convolution
    out = cupyx.scipy.ndimage.convolve1d(a, v, mode='constant')

    # Return result
    if mode == 'valid':
        remove = len(v) - 1
        out = out[remove//2 : -(remove+1)//2]
    return out

This should handle all cases but may not be the most efficient for all cases. In all likelihood, correlate will be implemented like the above and then convolve will be implemented in terms of correlate.

Implementing scipy.signal.convolve/correlate is a whole different can of worms. There is also fftconvolve and oaconvovle (oa = 'overlap-add' which is what we are using here). Both of those functions say they are much faster than convolve but convolve just uses those functions... they do confirm that the OA method is only efficient when len(a) >> len(b) or vice-versa. There is also choose_conv_method for determining which to call.

awthomp · 2020-03-12T15:07:05Z

Hi @coderforlife -- We've actually taken care of the correlate/convolve in cusignal. For large arrays/filters, we're getting a 1-2 order of magnitude perf gain over Scipy Signal, but -- as you can see -- we default to numpy (rather than invoking an fft) for small array sizes. This isn't ideal.

Much of cusignal is based on changing SciPy Signal Numpy calls for CuPy; when there's not support (like correlate2d or upfirdn), we wrote custom Numba CUDA kernels and are now exploring Raw CuPy Modules.

Usage examples are here: https://github.com/rapidsai/cusignal/blob/d64df2fc2c40bbdba4aad34641460176552cd5f6/notebooks/api_guide/signaltools_examples.ipynb

grlee77 · 2020-03-12T19:49:55Z

Hey @awthomp, I had implemented scipy.signal.upfirdn a little over a year ago (prior to discovering your version in cusignal). At that time I had also reused it to implement numpy.convolve and numpy.correlate (I had added a couple arguments to control the extent and origin in upfirdn). The implementation of upfirdn differs from cusignal's in that it uses RawKernel rather than Numba. I have finally also made this version public (see #2099 (comment)).

awthomp · 2020-03-12T19:55:17Z

@grlee77 Awesome! Looking forward to seeing your work. In the meantime, we also implemented the 1D double precision raw kernel for upfirdn in cusignal earlier this week (rapidsai/cusignal#25). @mnicely is currently looking at optimization of it with this WIP PR: rapidsai/cusignal#31. We'd love your eyes on, if you have an opportunity and desire to contribute.

This results in a single style of kernel to maintain while also mainting the speed of 1D kernels.

…lters

grlee77 · 2020-03-15T17:07:30Z

cupyx/scipy/ndimage/filters.py

+                         format(j=j, type=int_type))
+        else:
+            boundary = _generate_boundary_condition_ops(
+                mode, 'ix_{}'.format(j), 'xsize_{}'.format(j))
        loops.append('''


Suggested change

loops.append('''

loops.append('''

Thanks for refactoring to only avoid duplicate 1d and nd code paths!

I can confirm good performance of the refactored kernel. The only problem I encountered was the indentation error above.
The 1D case now performs approximately the same as what I had in cupyimg. 3D cases I tested are up to 25% faster for this kernel.

please address this!
thanks 🙂

Completed in #3390

emcastillo · 2020-03-23T09:53:32Z

cupyx/scipy/ndimage/filters.py

-    if not hasattr(origin, '__getitem__'):
-        origin = [origin, ] * input.ndim
+def _fix_sequence_arg(arg, ndim, name, conv=lambda x: x):
+    if hasattr(arg, '__iter__') and not isinstance(arg, str):


we favor try/except AttributeError over hasattr

@emcastillo Can you point me to some example code you want me to use?

Just curious why y'all prefer try/except over hasattr?
https://stackoverflow.com/questions/903130/hasattr-vs-try-except-block-to-deal-with-non-existent-attributes

Sorry, yes, we use try/ except AttributeError because it is faster.
If you think that in this routine speed doesn't matter you can leave it as is 🙂

I wonder if it would be cleaner and faster to check if arg is not a str and then just do arg = iter(arg).

So I tried swapping

if hasattr(arg, '__iter__') and not isinstance(arg, str):

with

if not isinstance(arg, str): try: arg = iter(arg) except AttributeError: return NotImplemented

Now most/all tests are failing. Are these not equivalent?

The exception for an invalid iter with iter is

TypeError: 'object' object is not iterable

😅 this might be confusing

cupyx/scipy/ndimage/filters.py

grlee77 · 2020-03-23T16:24:20Z

The 8 tests that are not passing are actually apparently a bug in Scipy. I have submitted a bug report over there: scipy/scipy#11661. It does not handle mirror correctly for length-1 dimensions with the 1D functions.

A fix for this will be included in SciPy 1.5 (scipy/scipy#11683). You can avoid the failing test cases by decorating those cases with:

@testing.with_requires('scipy>=1.5.0')

cupyx/scipy/ndimage/filters.py

chainer-ci · 2020-06-24T03:45:51Z

Jenkins CI test (for commit 693a2cc, target branch master) failed with status FAILURE.

emcastillo · 2020-06-24T03:48:41Z

Jenkins, test this please

pfn-ci-bot · 2020-06-24T03:48:46Z

Successfully created a job for commit 04e6ab3:

Dashboard for commit 04e6ab3

coderforlife · 2020-06-24T03:48:52Z

It looks like I forgot to decorate the TestFortranOrder tests with @testing.with_requires('scipy'). Sorry about that. I just committed that fix.

However, the vast majority of the failures were from the morphology code.

chainer-ci · 2020-06-24T04:34:54Z

Jenkins CI test (for commit 04e6ab3, target branch master) failed with status FAILURE.

coderforlife · 2020-06-24T04:43:00Z

Looks good (except morphology tests, someone else needs to fix that). I am curious about the coverage test, is it being run without scipy? It says none of my code (except the function declarations and imports) is covered...

emcastillo · 2020-06-24T04:51:15Z

Do you happen to know why morphology tests are failing?

_min_or_max_filter() takes 8 positional arguments but 9 were given
module 'cupyx.scipy.ndimage.filters' has no attribute '_normalize_sequence'
the _normalize_sequence was deleted in this PR, lets just restore it and clean it in another PR.
I can't merge this until the CIs are ok.

Don't care too much about the coverage test :)

coderforlife · 2020-06-24T06:14:25Z

_normalize_sequence was renamed to _fix_sequence_arg - I had no idea someone else was using these "protected" functions... just needs a third argument now for the name of the argument (used in the exception).

_min_or_max_filter doesn't have the structure argument anymore, I will have to figure that one out (it seems to be specifically created for the morphology functions). Additionally, the final argument is not a bool anymore but one of two strings ('min' or 'max').

So 2 easy things to fix and 1 thing I will have to look into, could be medium or hard.

emcastillo · 2020-06-24T06:45:07Z

If you want you can restore the old functions and someone can fix them later in a follow-up PR

…quence

This is used by morphology.grey_erosion() and grey_dilation().

coderforlife · 2020-06-25T04:00:01Z

I have fixed all of the issues. One other issue that didn't come up in the tests is that their tests used pytest.skip() which caused issues when running the tests with unittest which is recommended in the "contribution guide". So I switched it to raise unittest.SkipTest(...).

emcastillo · 2020-06-25T04:03:16Z

Awesome work and thanks once again for patiently addressing our comments!🙇‍♂️

emcastillo · 2020-06-25T04:04:29Z

Jenkins, test this please

pfn-ci-bot · 2020-06-25T04:04:34Z

Successfully created a job for commit 97c1c94:

Dashboard for commit 97c1c94

coderforlife · 2020-06-25T04:15:32Z

It's okay - I already have the next PR planned and it will hopefully go smoother. It is added all of the other basic filters on top of this core set (uniform_filter1d, uniform_filter, gaussian_filter1d, gaussian_filter, prewitt, sobel, generic_laplace, laplace, gaussian_laplace, generic_gradient_magnitude, gaussian_gradient_magnitude, rank_filter, median_filter, and percentile_filter). I have them mostly written (just need to update for recent changes this core code). However, no tests yet for them.

I am thinking to do them in two batches: non-linear filters (the last 3 in that list) since they actually have a bit of new cuda code with them, then do all of those linear filters (which all just call the convolve or convolve1d functions).

I have also started working on generic_filter but that is going to be a bit different since it has to parse another reduction kernel to build it. I have it working with a few simple cases of reduction kernels and fused functions, but it needs a bit more work. Of course, it can't quite match the scipy function (which takes any Python function and LowLevelCallbacks which wrap Cython or C functions), but I think taking fuseable/fused Python functions and ReductionKernels would match as close as possible.

chainer-ci · 2020-06-25T04:58:28Z

Jenkins CI test (for commit 97c1c94, target branch master) succeeded!

emcastillo · 2020-06-25T05:35:11Z

wow, thats awesome!

This PR will be merged after the release today, right now code is frozen.

jakirkham · 2020-06-25T08:32:39Z

This is great! Thanks everyone for pushing this forward 😄

coderforlife added 2 commits March 10, 2020 20:12

Improved ndimage convolve and correlate.

f0950a6

This is the first step towards a complete ndimage.filter suite. The functions here are designed to be flexible so that all other ndimage.filter functions can be created.

Adding correlate1d and convolve1d and tests.

7ba6cc4

All other filters will be based on this core set of code.

emcastillo self-assigned this Mar 11, 2020

coderforlife added 3 commits March 11, 2020 20:38

Performance improvements of convolve/correlate.

0ba202d

First change (the w[] to weights[] I forgot to do earlier) was the most major.

Fixing convolve1d/correlate1d tests.

fbd3f58

Bad copy-paste naming of the tests along with avoiding a scipy bug.

Fix a flake8 error.

41459bb

emcastillo mentioned this pull request Mar 12, 2020

Improve ndimage convolve/correlate #3179

Merged

grlee77 mentioned this pull request Mar 12, 2020

Adding equivalent filters from scipy.ndimage.filters #2099

Closed

coderforlife added 2 commits March 12, 2020 19:25

Combine the correlation 1D & ND CUDA kernels

4b214ab

This results in a single style of kernel to maintain while also mainting the speed of 1D kernels.

Merge branch 'master' of https://github.com/cupy/cupy into ndimage_fi…

2a29694

…lters

grlee77 reviewed Mar 15, 2020

View reviewed changes

This was referenced Mar 18, 2020

rewrite cupyx.scipy.ndimage.interpolation using ElementwiseKernel #3166

Merged

refactor convolve and correlate mritools/cupyimg#3

Merged

emcastillo added this to the v8.0.0b2 milestone Mar 23, 2020

emcastillo reviewed Mar 23, 2020

View reviewed changes

cupyx/scipy/ndimage/filters.py Outdated Show resolved Hide resolved

emcastillo reviewed Mar 24, 2020

View reviewed changes

cupyx/scipy/ndimage/filters.py Show resolved Hide resolved

emcastillo reviewed Mar 24, 2020

View reviewed changes

cupyx/scipy/ndimage/filters.py Show resolved Hide resolved

emcastillo reviewed Mar 24, 2020

View reviewed changes

cupyx/scipy/ndimage/filters.py Show resolved Hide resolved

emcastillo added cat:feature New features/APIs st:awaiting-author Awaiting response from author labels Mar 25, 2020

Add scipy requirement to Fortran test.

04e6ab3

asi1024 modified the milestones: v8.0.0b4, v8.0.0b5 Jun 24, 2020

coderforlife added 4 commits June 24, 2020 13:00

Changing morphology to use _fix_sequence_arg instead of _normalize_se…

837dd03

…quence

Updating morphology for change in last argument to _min_or_max_filter

8c905ff

Use unittest skip instead of pytest skip.

4168ca3

Adding support for structure in min/max filters.

68489c6

This is used by morphology.grey_erosion() and grey_dilation().

pep8 error

97c1c94

emcastillo added the st:test-and-merge (deprecated) Ready to merge after test pass. label Jun 25, 2020

mergify bot merged commit 60c620d into cupy:master Jun 25, 2020

emcastillo approved these changes Jun 25, 2020

View reviewed changes

asi1024 mentioned this pull request Jul 7, 2020

Use FFT in cupyx.scipy.ndimage.convolve1d #3565

Closed

asi1024 removed the st:awaiting-author Awaiting response from author label Jan 19, 2022

Implementation of ndimage filters #3184

Implementation of ndimage filters #3184

Conversation

coderforlife commented Mar 11, 2020 • edited

jakirkham commented Mar 11, 2020

coderforlife commented Mar 11, 2020

coderforlife commented Mar 11, 2020

awthomp commented Mar 12, 2020

coderforlife commented Mar 12, 2020

awthomp commented Mar 12, 2020

grlee77 commented Mar 12, 2020

awthomp commented Mar 12, 2020 • edited

Choose a reason for hiding this comment

grlee77 Mar 15, 2020 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mnicely Jun 2, 2020 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

grlee77 commented Mar 23, 2020

chainer-ci commented Jun 24, 2020

emcastillo commented Jun 24, 2020

pfn-ci-bot commented Jun 24, 2020

coderforlife commented Jun 24, 2020

chainer-ci commented Jun 24, 2020

coderforlife commented Jun 24, 2020

emcastillo commented Jun 24, 2020 • edited

coderforlife commented Jun 24, 2020

emcastillo commented Jun 24, 2020

coderforlife commented Jun 25, 2020

emcastillo commented Jun 25, 2020

emcastillo commented Jun 25, 2020

pfn-ci-bot commented Jun 25, 2020

coderforlife commented Jun 25, 2020

chainer-ci commented Jun 25, 2020

emcastillo commented Jun 25, 2020

jakirkham commented Jun 25, 2020

coderforlife commented Mar 11, 2020 •

edited

awthomp commented Mar 12, 2020 •

edited

grlee77 Mar 15, 2020 •

edited

mnicely Jun 2, 2020 •

edited

emcastillo commented Jun 24, 2020 •

edited