WIP, ENH: signal: `test_spectral` array API support #19001

tylerjereddy · 2023-08-01T20:49:26Z

Towards gh-20678

TODO:

get the original welch benchmarks from https://github.com/data-apis/scipy-2023-presentation working again... (and benchmarks for any other parts of signal covered here)
can we live with adoption of array API in signal on a per-test-module basis like this?
what to do about sliding_window_view view shenanigans; this has been discussed many times like at Need for dealing with strides when writing performant code against multiple array libraries data-apis/array-api#641 (comment) and the conference proceedings paper
scope relevant part of signal tests to array API CI

signal array API test module checklist, to be moved to an issue probably, since this PR now focuses on a more limited scope than all of signal

ilayn

Some comments; my ignorance about array API notwithstanding, seemingly we have a lot of work to adopt this API. Unfortunately same seems to be true for array vendors too. This is exactly what I was hoping to get in my comments #18915 so thanks a lot for this. Very illustrative.

ilayn · 2023-08-01T21:56:42Z

scipy/signal/_signaltools.py

+            A[:, 0] = xp.arange(1, Npts + 1, dtype=dtype) / Npts
+            sl = slice(int(bp[m]), int(bp[m + 1]))
+            # NOTE: lstsq isn't in the array API standard
+            if "cupy" in xp.__name__ or "torch" in xp.__name__:


Isn't this library specific conditions anti-pattern already?

ilayn · 2023-08-01T22:11:45Z

scipy/signal/_signaltools.py

+            new_bp[:size(bp)] = bp
+            new_bp[size(bp)] = N
+            bp = xp.sort(xp.unique(xp.asarray([0] + new_bp)))
+        if xp.any(bp > N):


We sorted bp already so you can compare the last element instead to save some pennies.

Error message is also a bit cryptic though not relevant.

ilayn · 2023-08-01T22:27:06Z

scipy/signal/_signaltools.py

+            new_bp = xp.empty(size(bp) + 1)
+            new_bp[:size(bp)] = bp
+            new_bp[size(bp)] = N
+            bp = xp.sort(xp.unique(xp.asarray([0] + new_bp)))


Isn't there something that we can avoid doing this type of back and forth?

I found this https://data-apis.org/array-api/latest/API_specification/generated/array_api.unique_values.html?highlight=unique#array_api.unique_values but it doesn't have the NumPy functionality.

For example, numpy.unique has return_index keyword to get rid of the sorting need but I don't know if array API supports it which is my biggest concern adopting this api; potential code divergence later in different packages. Mini demo;

>>> import numpy as np >>> arr = np.array([1.0, np.sqrt(2), np.sqrt(2), 1/3, 1/3, np.pi]) >>> arrun, ind = np.unique(arr, return_index=True) >>> arr[ind] array([0.33333333, 1. , 1.41421356, 3.14159265])

So indeed you can embed bp into a larger array

new_bp = xp.empty(size(bp)+2, dtype=dtype) new_bp[0] = 0 new_bp[-1] = N # -1 is not working anymore in array_api? new_bp[1:-1] = bp

I know bp is typically a small array but still the code gets really complicated for a relatively simple operation.

Yes this code should not get this complicated. np.unique(..., return_index=True) can be replaced by xp.unique_all. I'm not sure if that gets rid of the sorting operation here, because that's already in the original code - but if that original code can be improved first, there is a matching unique_* function that can be used.

ilayn · 2023-08-01T22:27:50Z

scipy/signal/_spectral_py.py

-                    Pxy = np.median(Pxy, axis=-1)
-                Pxy /= bias
+                    Pxy = xp.median(Pxy, axis=-1)
+                # for PyTorch, Pxy is torch.return_types.median


Same anti-pattern comment here and a bit below. Isn't array api unapologetically expecting all arrays to be uniformly having the same methods and properties?

ilayn · 2023-08-01T22:28:47Z

scipy/signal/_spectral_py.py

+        try:
+            is_complex = xp.iscomplexobj(x)
+        except AttributeError:
+            # torch shim


Same troubling line

ilayn · 2023-08-01T22:31:16Z

scipy/signal/_spectral_py.py

+        if "torch" in xp.__name__:
+            strides = x.stride()[:-1]+(step*x.stride()[-1], x.stride()[-1])
+            result = xp.as_strided(x, size=shape, stride=strides)
+        elif "cupy" in xp.__name__:


This is more significant for me than the performance hit below. I would instead save the NumPy code branch to save good weather SciPy performance first and deal with other libraries later if they support or not.

ilayn · 2023-08-01T22:34:29Z

scipy/signal/_signaltools.py

    if type in ['constant', 'c']:
-        ret = data - np.mean(data, axis, keepdims=True)
+        ret = data - xp.mean(xp.asarray(data, dtype=dtype), axis=axis, keepdims=True)


Can't we just do the conversion with the right dtype once at the top? Seems like lots of asarray creeping in unintentionally.

ev-br · 2023-08-01T23:22:36Z

One other potentially tangentially related thing while we're at the subject of supporting other arrays types. There is an ongoing effort to provide a clone of scipy.signal API in cupy, as cupyx.scipy.signal. Some of these functions might be implemented using dedicated Cuda kernels etc.

So there might be value in trying to detect upfront if all arguments are cupy-compatible and delegate to the matching cupyx.scipy.signal function. This does not of course preclude transforming internal implementations to Array API so that other implementors can benefit. Just something to keep in mind for the future. Maybe we'd want to invent a sort of way to register an alternative implementation and its preconditions.

h-vetinari · 2023-08-02T23:20:06Z

So there might be value in trying to detect upfront if all arguments are cupy-compatible and delegate to the matching cupyx.scipy.signal function.

I'm not sure that's such a good thing. It would get us an optional dependency on cupy, which itself optionally depends on scipy. That's a kind of weak dependency cycle (though solvable), but mainly it would complicate our testing here (at least one job {with, without} cupy; and to really test the GPU path, we'd need that in our CI), and for IMO very little gain.

If it stays only in cupy, the message is clear: use the cupy compat layer to utilize CUDA. If we start optionally dispatching to cupy, it becomes much harder to explain to users what they can expect.

I'm not saying it's a dealbreaker, but a priori I'm not enamoured with the idea.

lucascolley

Hi Tyler, looks like the progress we have made elsewhere will put this PR in much better shape after a few changes!

lucascolley · 2023-09-07T11:37:07Z

scipy/signal/_signaltools.py

        newdata_shape = newdata.shape
        newdata = newdata.reshape(N, -1)

        if not overwrite_data:
-            newdata = newdata.copy()  # make sure we have a copy
-        if newdata.dtype.char not in 'dfDF':
+            newdata = xp.asarray(newdata, copy=True)  # make sure we have a copy


We can use copy from _array_api.py now.

lucascolley · 2023-09-07T11:40:27Z

scipy/signal/_signaltools.py

-            newdata = newdata.copy()  # make sure we have a copy
-        if newdata.dtype.char not in 'dfDF':
+            newdata = xp.asarray(newdata, copy=True)  # make sure we have a copy
+        if newdata.dtype not in [xp.float64, xp.float32, xp.complex128, xp.complex64]:
            newdata = newdata.astype(dtype)


this should be xp.astype(newdata, dtype).

lucascolley · 2023-09-07T11:42:57Z

scipy/signal/_signaltools.py

    if type in ['constant', 'c']:
-        ret = data - np.mean(data, axis, keepdims=True)
+        ret = data - xp.mean(xp.asarray(data, dtype=dtype), axis=axis, keepdims=True)


xp.astype(data, dtype) is clearer

lucascolley · 2023-09-07T11:56:40Z

scipy/signal/_signaltools.py

+            A[:, 0] = xp.arange(1, Npts + 1, dtype=dtype) / Npts
+            sl = slice(int(bp[m]), int(bp[m + 1]))
+            # NOTE: lstsq isn't in the array API standard
+            if "cupy" in xp.__name__ or "torch" in xp.__name__:


perhaps best to just use scipy.linalg in this PR and let it fail? Then, at a point where lstsq works with these libraries, no changes will be needed here.

A separate PR could address this library-specific special-casing for now, like Matt's gh-19023. I think it makes sense to separate the array API work, which is here to stay, and this sort of "dispatching" which will hopefully be replaced by something more robust eventually.

lucascolley · 2023-09-07T12:02:34Z

scipy/signal/_spectral_py.py

-                    Pxy = np.median(Pxy, axis=-1)
-                Pxy /= bias
+                    Pxy = xp.median(Pxy, axis=-1)
+                # for PyTorch, Pxy is torch.return_types.median


Looks like we need to write an array-agnostic median à la cov (or push for it to be added to the standard). Again, I would remove the special casing for now and let this fail so that it's clear where work needs to be done.

lucascolley · 2023-09-07T12:11:19Z

scipy/signal/_spectral_py.py

-    x = np.asarray(x)
+    # Ensure we have xp.arrays, get outdtype
+    x = xp.asarray(x)
+    # https://github.com/data-apis/array-api-compat/issues/43


This should be able to be cleaned up now after data-apis/array-api-compat#55.

lucascolley · 2023-09-07T12:14:05Z

scipy/signal/_spectral_py.py


    if return_onesided:
-        if np.iscomplexobj(x):
+        try:
+            is_complex = xp.iscomplexobj(x)


Looks like we should add an is_complex to _array_api.py, which I think would be x.dtype in {xp.complex64, xp.complex128}.

lucascolley · 2023-09-07T12:16:59Z

scipy/signal/_spectral_py.py


    # Detrend each data segment individually
    result = detrend_func(result)

    # Apply window by multiplication
+    # NOTE: torch device shim -- needs


This should be fixed after pytorch/pytorch#106773 I think.

lucascolley · 2023-09-07T12:17:59Z

scipy/signal/tests/test_spectral.py

-        x = np.zeros(16)
+        assert_allclose(array_api_compat.to_device(p, "cpu"),
+                        q, atol=1e-7, rtol=1e-7)
+        assert_allclose(array_api_compat.to_device(f, "cpu"),


Quite a few changes to be made here to use the new xp_assert_close.

lucascolley · 2023-09-07T12:19:06Z

scipy/signal/tests/test_spectral.py

-        f, p = welch([])
+    @array_api_compatible
+    def test_empty_input(self, xp):
+        val = xp.asarray([]) if SCIPY_ARRAY_API else []


Suggested change

val = xp.asarray([]) if SCIPY_ARRAY_API else []

val = xp.asarray([])

tylerjereddy · 2023-10-27T22:17:19Z

I rebased on latest main, and made a few fixups, but CI should be skipped entirely for now because I still need to address tons of comments and modernizations following all the recent array API infra progress.

For SCIPY_DEVICE=cuda python dev.py test -j 32 -b all on this branch, I get 39 failures, vs. 9 on main locally.

Most of the failures are of the form AttributeError: module 'numpy.array_api' has no attribute 'fft'. There are also some threading failures I saw with CUDA + high concurrency that Ralf and Lucas did not, but anyway this branch still needs more work.

ilayn · 2023-10-28T10:27:11Z

You can ignore most of my comments as we discussed its bits and pieces in different issues (mainly in the linalg one), plus my understanding is better than when I was punching those comments above. Hence all good from my side.

lucascolley · 2023-10-29T08:05:43Z

scipy/signal/_spectral_py.py

    freqs, Pxx = csd(x, x, fs=fs, window=window, nperseg=nperseg,
                     noverlap=noverlap, nfft=nfft, detrend=detrend,
                     return_onesided=return_onesided, scaling=scaling,
                     axis=axis, average=average)

-    return freqs, Pxx.real
+    if Pxx.dtype in {xp.complex64, xp.complex128}:


We have an is_complex helper in _lib._array_api.py

lucascolley · 2023-10-29T08:15:08Z

scipy/signal/_spectral_py.py

+        is_complex = x.dtype in {xp.complex64, xp.complex128}
+
+        if is_complex:


This is in _lib now.

lucascolley · 2023-10-29T08:17:12Z

scipy/signal/_spectral_py.py

+        if hasattr(xp, 'fft'):
+            freqs = xp.fft.fftfreq(nfft, 1/fs)
+        else:
+            freqs = xp.asarray(np.fft.fftfreq(nfft, 1/fs))
+
    elif sides == 'onesided':
-        freqs = sp_fft.rfftfreq(nfft, 1/fs)
+        if hasattr(xp, 'fft'):
+            freqs = xp.fft.rfftfreq(nfft, 1/fs)
+        else:
+            freqs = xp.asarray(np.fft.rfftfreq(nfft, 1/fs))


I think that freqs = scipy.fft.fftfreq(..., xp=xp) will work here, with negligible overhead.

lucascolley · 2023-10-29T08:19:23Z

scipy/signal/_spectral_py.py

    else:
-        result = result.real
-        func = sp_fft.rfft
+        if result.dtype in {xp.complex64, xp.complex128}:


Can use is_complex here too.

lucascolley · 2023-10-29T08:19:47Z

scipy/signal/_spectral_py.py

+        if hasattr(xp, "fft"):
+            func = xp.fft.fft
+        else:
+            func = np.fft.fft


scipy.fft.fft?

lucascolley · 2023-10-29T08:20:06Z

scipy/signal/_spectral_py.py

+        if hasattr(xp, "fft"):
+            func = xp.fft.rfft
+        else:
+            func = np.fft.rfft


scipy.fft.rfft?

lucascolley · 2023-10-29T09:04:26Z

scipy/signal/_signaltools.py

-            newdata = newdata.copy()  # make sure we have a copy
-        if newdata.dtype.char not in 'dfDF':
+            newdata = xp.asarray(newdata, copy=True)  # make sure we have a copy
+        if newdata.dtype not in [xp.float64, xp.float32, xp.complex128, xp.complex64]:


Maybe worth a new is_floating helper?

lucascolley · 2023-10-29T09:05:27Z

scipy/signal/_spectral_py.py

-                if np.iscomplexobj(Pxy):
-                    Pxy = (np.median(np.real(Pxy), axis=-1)
-                           + 1j * np.median(np.imag(Pxy), axis=-1))
+                if Pxy.dtype in [xp.complex64, xp.complex128]:


is_complex here too.

lucascolley · 2023-10-29T09:09:04Z

scipy/signal/_spectral_py.py

            sides = 'twosided'
            warnings.warn('Input data is complex, switching to '
                          'return_onesided=False')
        else:
            sides = 'onesided'
            if not same_data:
-                if np.iscomplexobj(y):
+                if xp.iscomplexobj(y):


Looks like this isn't in the standard? Do we just need is_complex here too?

lucascolley · 2023-10-29T09:09:34Z

scipy/signal/_spectral_py.py

                     nperseg - noverlap)/float(fs)
    if boundary is not None:
        time -= (nperseg/2) / fs

-    result = result.astype(outdtype)
+    result = xp.asarray(result, dtype=outdtype)


xp.astype(...) is clearer

This one was intentional because astype fails with E AttributeError: 'numpy.ndarray' object has no attribute '_array'. I didn't look into it any deeper, but it is for the numpy.array_api backend.

hmm maybe result is arriving as a regular np array? IIUC it should be a np.array_api array so astype should work.

lucascolley · 2023-10-29T09:10:24Z

scipy/signal/_spectral_py.py

+            result = xp.as_strided(x, size=shape, stride=strides)
+        elif "cupy" in xp.__name__:
+            strides = x.strides[:-1]+(step*x.strides[-1], x.strides[-1])
+            result = xp.lib.stride_tricks.as_strided(x, shape=shape,


As mentioned above, I think the special-casing should be saved for a separate PR.

tylerjereddy · 2023-10-29T18:30:47Z

I addressed a few more comments (not all, yet). Some notes on what I'm seeing challenges with:

some of scipy.fft needed shims for device keyword handling with cupy and NumPy it seems, at least in my hands
xp.mean() strictly requires "real" type input, though most backends let us cheat on that for now
xp.unique reorganization isn't widely adopted yet I don't think? shim for that (unique_all, unique_values..)
some test skips added for numpy.array_api because of much stricter casting rules in some cases
perhaps most importantly, almost all current failures related to TestWelch come from the lack of moveaxis in the array API standard (numpy.array_api; other backends we can cheat because they have it)--suggestion for a graceful substitute?

lucascolley · 2023-10-29T18:52:34Z

scipy/fft/_helper.py

-    if hasattr(xp, 'fft') and xp.__name__ != 'numpy':
+    if (hasattr(xp, 'fft') and xp.__name__ != 'numpy' and
+        "array_api_compat.numpy" not in xp.__name__ and
+        "cupy" not in xp.__name__):


The compat numpy shim looks like a correct bug fix - you can use is_numpy from _lib to make this cleaner. We didn't catch this since we only test this function with 'raw' np arrays.

Edit: you may want is_numpy etc. to accept the raw versions as well as the compat versions. IIRC this shouldn't break anything in fft, I left it as just the compat versions as that was all we needed at the time.

For CuPy, I think we should remove this shim and skip any tests on CuPy which fail because of it. Then once array-api-compat implements fft, we can just remove the skip. The temporary shims seem unnecessary since there is no rush to get CuPy working partially in a release, but that's just my opinion.

We could add a comment into the code here to state that array-api-compat hasn't yet implemented fft if you want to; there will be a few issues like this until it has.

lucascolley · 2023-10-29T20:43:23Z

scipy/fft/_helper.py

@@ -205,7 +207,9 @@ def rfftfreq(n, d=1.0, *, xp=None, device=None):
    xp = np if xp is None else xp
    # numpy does not yet support the `device` keyword
    # `xp.__name__ != 'numpy'` should be removed when numpy is compatible
-    if hasattr(xp, 'fft') and xp.__name__ != 'numpy':
+    if (hasattr(xp, 'fft') and xp.__name__ != 'numpy' and
+        "array_api_compat.numpy" not in xp.__name__ and


is_numpy here too.

lucascolley · 2023-10-29T20:45:30Z

scipy/signal/_signaltools.py

-        dtype = 'd'
+    data = xp.asarray(data)
+    dtype = data.dtype
+    if data.dtype not in [xp.float32, xp.float64, xp.complex64, xp.complex128]:


a new is_floating could be used here too as not is_floating.

tylerjereddy · 2023-10-31T19:37:53Z

From the latest commit:

    MAINT: PR 19001 revisions
    
    * replace an occurrence of `xp.is_complex()` with
    `is_complex()` (our internal shim) in `_spectral_helper()`;
    `is_complex` is not standard
    
    * many tests in `TestWelch` are now skipped with
    `numpy.array_api` backend because of the lack of the
    non-standard `moveaxis`
    
    * use our internal `is_complex()` helper in `csd()` as
    well, based on reviewer feedback
    
    * based on reviewer feedback, use `is_numpy(xp)` in the
    adjusted `fft._helper` filters, and don't filter `cupy`
    here; as a result, a number of `TestWelch` tests have
    a conditional skip added for `cupy` backend until `array_api_compat`
    shims around `fft`
    
    * clean up some unused imports based on linter whining
    
    * this patch appears to place this branch on parity with `main`
    for CUDA-backend array API full suite testing, though major
    tasks remain to be addressed, including switching to newer
    API testing machinery and probably covering more of `signal`
    before we'd seriously consider moving forward
    
    [skip ci] [skip circle]

tylerjereddy · 2023-11-02T02:11:05Z

I think I've pushed in most of the modernization to new xp_assert.. testing machinery now. Some things that came up while doing that:

assert_array_almost_equal_nulp doesn't have an equivalent array API testing shim yet?
in some cases I have to turn off namespace or dtype checking since these new tests are stricter (I bet some of those we could enforce with careful design, others probably need to stay for now...)
a small number of cases I couldn't replace because of non-compliant ops in the assertion lines themselves (or, at least, it may require more thought... like np.trapz and so on)

tylerjereddy · 2023-11-06T23:03:16Z

The draft conversion of TestCSD to array API testing was pushed in, though it is slightly depressing because a huge number of skips were needed for moveaxis, np.pad, and array-api-compat fft support. That said, I believe there is still some value in clearly labeling the active blockers for each test, etc., since it makes it clearer which things are waiting on the ecosystem and/or makes it easier for reviewers to identify things they don't agree should be blockers/skipped.

Perhaps I'll try migrating TestPeriodogram next.

lucascolley · 2023-11-14T22:11:06Z

want to move from numpy.testing.assert_ to assert while we're here? (maybe just for what is already diffed in which case line 1545)

tylerjereddy · 2023-11-20T21:21:03Z

All tests in test_spectral.py should now have draft array API testing implementations/conversions, along with skip markers where I needed them. I've also updated the original PR comment above to contain a checklist of the signal test module migration progress. So, that's just the first one.

lucascolley · 2023-12-22T15:25:57Z

scipy/signal/tests/test_spectral.py

@@ -18,120 +17,200 @@
 from scipy.signal.tests._scipy_spectral_test_shim import stft_compare as stft
 from scipy.signal.tests._scipy_spectral_test_shim import istft_compare as istft
 from scipy.signal.tests._scipy_spectral_test_shim import csd_compare as csd
+from scipy.conftest import array_api_compatible, skip_if_array_api_backend
+from scipy._lib.array_api_compat import array_api_compat


This import path was cleaned up recently.

Suggested change

from scipy._lib.array_api_compat import array_api_compat

from scipy._lib import array_api_compat

* use `xp.real(x)` instead of `x.real` for API compat (similar shim for `mean`) * guard use of `fft` optional extension [skip ci] [skip circle]

* handle "dispatch" to FFT functionality via `scipy.fft`, based on reviewer feedback; this also allowed usage of `xp.astype()` in at least one additional place because my manual FFT shims did not convert back to the appropriate array type * I did need some `scipy.fft` shims for `device` keyword handling though... * add a shim for `unique` vs. `unique_values` * some test skips for `numpy.array_api` dtype/casting issues * use `is_complex()` in several places, based on reviewer feedback [skip ci] [skip circle]

* replace an occurrence of `xp.is_complex()` with `is_complex()` (our internal shim) in `_spectral_helper()`; `is_complex` is not standard * many tests in `TestWelch` are now skipped with `numpy.array_api` backend because of the lack of the non-standard `moveaxis` * use our internal `is_complex()` helper in `csd()` as well, based on reviewer feedback * based on reviewer feedback, use `is_numpy(xp)` in the adjusted `fft._helper` filters, and don't filter `cupy` here; as a result, a number of `TestWelch` tests have a conditional skip added for `cupy` backend until `array_api_compat` shims around `fft` * clean up some unused imports based on linter whining * this patch appears to place this branch on parity with `main` for CUDA-backend array API full suite testing, though major tasks remain to be addressed, including switching to newer API testing machinery and probably covering more of `signal` before we'd seriously consider moving forward [skip ci] [skip circle]

* start modernizing the array API tests to use the new `xp_assert..` functions [skip ci] [skip circle]

* continue (finish?) modernizing the array API tests to use the new `xp_assert..` functions [skip ci] [skip circle]

* start converting `TestCSD` to array API testing; most of the tests I've converted require substantial backend skipping because of i.e., `moveaxis` and `pad` usage, or lack of `fft` coverage by array-api-compat [skip ci] [skip circle]

* finish the draft conversion of `TestCSD` to array API testing; unfortunately, a large number of backend skips were needed across the board (many of these are known issues like `moveaxis` and array_api_compat fft support) [skip ci] [skip circle]

* convert 8 tests in `TestPeriodogram` to array API testing approach, along with appropriate skips where needed [skip ci] [skip circle]

* convert 4 tests in `TestPeriodogram` to array API testing approach, along with appropriate skips where needed [skip ci] [skip circle]

* convert remaining tests in `TestPeriodogram` to array API testing approach, along with appropriate skips where needed [skip ci] [skip circle]

* convert tests in `TestSpectrogram` to array API testing approach, along with appropriate skips where needed [skip ci] [skip circle]

* convert tests in `TestLombscargle` to array API testing approach, along with appropriate skips where needed [skip ci] [skip circle]

* convert `TestSTFT::test_input_validation` to array API testing approach, along with appropriate skips where needed * this required a fairly substantial number of changes, and we still end up having to skip the usual three backends, for now [skip ci] [skip circle]

* convert the next two `TestSTFT` tests to array API testing approach [skip ci] [skip circle]

* convert the remaining `test_spectral.py` tests to array API testing approach [skip ci] [skip circle]

* deal with large rebasing mess and the need to adopt modern array API testing procedures that have changed a few times in SciPy now... * there are still some array API test failures in `signal` to deal with... [skip ci] [skip circle] [skip cirrus]

* fix failing `test_window_correction`, `test_window_external`, `test_roundtrip_real` * add some missing imports to `test_spectral.py` (these were causing test failures in the regular suite) * restore some custom/fast paths for `sliding_window_view` [ci skip] [skip ci]

* haven't flushed the CI on this branch in months [skip cirrus] [skip circle]

* fix linter issues reported in CI * clean up a test failure related to `np.trapz` usage detected by the CI [skip cirrus] [skip circle]

* fix a few testing issues: NumPy `trapezoid` usage is now conditional on NumPy version, `test_roundtrip_not_nola` skipping reasons are now properly formatted, and `_spectral_helper` has a broader check on the array namespace [skip cirrus] [skip circle]

tylerjereddy

@lucascolley @mdhaber this is passing the array API tests in CI again, and locally with GPU, but clearly still has issues given some of my recent comments and the fact that things have evolved over the months since I started working on this.

I believe a firm decision on how we're going to handle the sliding window view may be useful. Should we special case CuPy, torch, and NumPy so they are a few orders of magnitude faster with welch, but error out for other hypothetical array API backends? Or add the slow for loops for any other hypothetical array API backends and they likely suffer a major performance hit but at least can execute?

tylerjereddy · 2024-04-28T21:40:43Z

scipy/signal/_spectral_py.py

+        # see: https://github.com/data-apis/array-api/issues/641#issuecomment-1604884351
+        if is_cupy(xp):
+            result = xp.lib.stride_tricks.sliding_window_view(
+                x, window_shape=nperseg, axis=-1, writeable=True


Trying to re-run the benchmarks from the SciPy conference array API proceedings paper (https://github.com/data-apis/scipy-2023-presentation) with CuPy 13.1.0 (which actually has sliding_window_view) I see:

File "/home/treddy/python_venvs/py_311_scipy_dev/lib/python3.11/site-packages/cupy/lib/stride_tricks.py", line 119, in sliding_window_view raise NotImplementedError("Writeable views are not supported.") NotImplementedError: Writeable views are not supported.

tylerjereddy · 2024-04-28T21:43:10Z

scipy/signal/_spectral_py.py

@@ -2007,10 +2060,11 @@ def _fft_helper(x, win, detrend_func, nperseg, noverlap, nfft, sides):
    if sides == 'twosided':
        func = sp_fft.fft
    else:
-        result = result.real
+        if is_complex(result, xp=xp):


Just above here, torch on GPU fails the proceedings paper benchmarks as well with:

Running: python bench.py scipy torch_gpu Traceback (most recent call last): File "/home/treddy/github_projects/scipy-2023-presentation/benchmarks/bench.py", line 130, in <module> main(benchmark, namespace) File "/home/treddy/github_projects/scipy-2023-presentation/benchmarks/bench.py", line 78, in main bench(namespace, print_times=False) File "/home/treddy/github_projects/scipy-2023-presentation/benchmarks/bench.py", line 119, in bench_scipy f, p = welch(x, nperseg=8) ^^^^^^^^^^^^^^^^^^^ File "/home/treddy/python_venvs/py_311_scipy_dev/lib/python3.11/site-packages/scipy/signal/_spectral_py.py", line 467, in welch freqs, Pxx = csd(x, x, fs=fs, window=window, nperseg=nperseg, ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/treddy/python_venvs/py_311_scipy_dev/lib/python3.11/site-packages/scipy/signal/_spectral_py.py", line 610, in csd freqs, _, Pxy = _spectral_helper(x, y, fs, window, nperseg, noverlap, ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/treddy/python_venvs/py_311_scipy_dev/lib/python3.11/site-packages/scipy/signal/_spectral_py.py", line 1963, in _spectral_helper result = _fft_helper(x, win, detrend_func, nperseg, noverlap, nfft, sides) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/treddy/python_venvs/py_311_scipy_dev/lib/python3.11/site-packages/scipy/signal/_spectral_py.py", line 2057, in _fft_helper result = win * result ~~~~^~~~~~~~ RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu!

tylerjereddy · 2024-04-28T21:49:30Z

scipy/signal/_spectral_py.py

+        # violating the array API standard gives us 4 orders of magnitude
+        # better performance via views/striding, so preserving some special
+        # casing
+        # see: https://github.com/data-apis/array-api/issues/641#issuecomment-1604884351


I believe this branch used to have shims for array API support for libs other than CuPy and torch with something like the suggestion at: #18286 (comment)

I believe the underlying code also switched from as_strided to sliding_windows_view in main a few months ago as well, so this code path has mutated for the Nth time. Either way, this appears to pass the array API tests locally with the current skips I have at least, but doesn't really support the array API unless we restore a codepath that is a few orders of magnitude slower I think per the linked comment above.

This looks pretty good. I'd only suggest moving sliding_window_view function to scipy/_lib/_array_api.py? Then it will work with a one-line change here, and it will work later in any other place we may want to use sliding_window_view.

Failing for now on non-numpy/cupy/torch and just adding a TODO comment seems okay.

tylerjereddy · 2024-04-28T22:00:25Z

I guess I could scope in the relevant parts of signal for array API CI testing here as well, though there's still a fair bit of code work to do before worrying about flushing the CI a bunch just yet.

lucascolley · 2024-04-28T22:03:41Z

Should we special case CuPy, torch, and NumPy so they are a few orders of magnitude faster with welch

Or add the slow for loops for any other hypothetical array API backends and they likely suffer a major performance hit but at least can execute?

'Both' sounds like the right answer to me.

tylerjereddy · 2024-04-28T22:12:43Z

one other question as I scan through the diff of test skips -- any movement or guidance on assert_array_almost_equal_nulp style tests for array API?

rgommers · 2024-04-30T07:15:14Z

scipy/signal/_signaltools.py

        return ret
    else:
        dshape = data.shape
        N = dshape[axis]
-        bp = np.sort(np.unique(np.concatenate(np.atleast_1d(0, bp, N))))
-        if np.any(bp > N):
+        if isinstance(bp, int):


This if-else doesn't look like it should be here. bp is an array-like of ints; just coercing it to an array on its own line should be the way to go.

DietBru · 2024-05-23T08:11:01Z

Sorry for being a bit late to the discussion. An alternative solution to the sliding_window_view problem would be to implement a _spectral_helper-free csd function. This would allow deprecating the following functions in file _spectral_py.py:

_spectral_helper which is only used by csd, spectrogram (legacy) and stft (legacy)
_fft_helper which is only used by _spectral_helper (only function using sliding_window_view)
_triage_segments which is only used by _spectral_helper and spectrogram (legacy)
spectrogram (legacy)
stft (legacy) and thus istft (legacy)

For testing feature parity between the existing _spectral_helper and the new ShortTimeFFT class, I had implemented _spect_helper_csd, which is a ShortTimeFFT-based _spectral_helper implementation for wrapping the csd function in unit testing. It could be used to replace the existing csd function (since it already passes all unit tests).

Until now, I did not really see the need, since the replacement would not improve the functionality of csd.

tylerjereddy added the scipy.signal label Aug 1, 2023

tylerjereddy requested review from larsoner and ilayn as code owners August 1, 2023 20:49

ilayn reviewed Aug 1, 2023

View reviewed changes

tylerjereddy mentioned this pull request Aug 2, 2023

ENH: fft: support array API standard #19005

Merged

rgommers added the array types Items related to array API support and input array validation (see gh-18286) label Aug 16, 2023

ilayn mentioned this pull request Aug 16, 2023

ENH: linalg: array library interoperability #19068

Open

lucascolley reviewed Sep 7, 2023

View reviewed changes

tylerjereddy force-pushed the treddy_signal_array_api branch from 168f483 to ebdbc5b Compare October 27, 2023 22:14

lucascolley reviewed Oct 29, 2023

View reviewed changes

tylerjereddy requested a review from peterbell10 as a code owner October 29, 2023 18:24

lucascolley reviewed Oct 29, 2023

View reviewed changes

tylerjereddy changed the title ~~WIP, ENH: signal (welch) array API support~~ WIP, ENH: signal array API support Nov 8, 2023

lucascolley mentioned this pull request Dec 22, 2023

ENH/TST/MAINT: fft: follow-ups for array API support #19257

Open

9 tasks

lucascolley reviewed Dec 22, 2023

View reviewed changes

lucascolley added the enhancement A new feature or improvement label Dec 23, 2023

lucascolley mentioned this pull request Dec 23, 2023

ENH: port scipy.signal._arraytools to be Array API compatible #15395

Closed

lucascolley marked this pull request as draft March 14, 2024 21:33

tylerjereddy changed the title ~~WIP, ENH: signal array API support~~ WIP, ENH: signal array API support (test_spectral) Mar 27, 2024

tylerjereddy added 17 commits April 26, 2024 19:09

MAINT: PR 19001 revisions

aec2d52

* use `xp.real(x)` instead of `x.real` for API compat (similar shim for `mean`) * guard use of `fft` optional extension [skip ci] [skip circle]

TST: PR 19001 revisions

14d5a9d

* start modernizing the array API tests to use the new `xp_assert..` functions [skip ci] [skip circle]

TST: PR 19001 revisions

638e723

* continue (finish?) modernizing the array API tests to use the new `xp_assert..` functions [skip ci] [skip circle]

MAINT: PR 19001 revisions

3655dea

* start converting `TestCSD` to array API testing; most of the tests I've converted require substantial backend skipping because of i.e., `moveaxis` and `pad` usage, or lack of `fft` coverage by array-api-compat [skip ci] [skip circle]

MAINT: PR 19001 revisions

beacaeb

* finish the draft conversion of `TestCSD` to array API testing; unfortunately, a large number of backend skips were needed across the board (many of these are known issues like `moveaxis` and array_api_compat fft support) [skip ci] [skip circle]

MAINT: PR 19001 revisions

677523a

* convert 8 tests in `TestPeriodogram` to array API testing approach, along with appropriate skips where needed [skip ci] [skip circle]

MAINT: PR 19001 revisions

cf3f7ca

* convert 4 tests in `TestPeriodogram` to array API testing approach, along with appropriate skips where needed [skip ci] [skip circle]

MAINT: PR 19001 revisions

c6787d6

* convert remaining tests in `TestPeriodogram` to array API testing approach, along with appropriate skips where needed [skip ci] [skip circle]

MAINT: PR 19001 revisions

483ccc7

* convert tests in `TestSpectrogram` to array API testing approach, along with appropriate skips where needed [skip ci] [skip circle]

MAINT: PR 19001 revisions

b336554

* convert tests in `TestLombscargle` to array API testing approach, along with appropriate skips where needed [skip ci] [skip circle]

TST: PR 19001 revisions

b26a5f4

* convert the next two `TestSTFT` tests to array API testing approach [skip ci] [skip circle]

MAINT: PR 19001 revisions

e74c587

* convert the remaining `test_spectral.py` tests to array API testing approach [skip ci] [skip circle]

MAINT, TST: PR 19001 revisions

dae8b38

* deal with large rebasing mess and the need to adopt modern array API testing procedures that have changed a few times in SciPy now... * there are still some array API test failures in `signal` to deal with... [skip ci] [skip circle] [skip cirrus]

tylerjereddy force-pushed the treddy_signal_array_api branch from 76cb16d to 0d2caac Compare April 28, 2024 02:17

tylerjereddy added 3 commits April 28, 2024 12:09

CI: flush ci

cb7a5c0

* haven't flushed the CI on this branch in months [skip cirrus] [skip circle]

MAINT: PR 19001 revisions

7ec8abc

* fix linter issues reported in CI * clean up a test failure related to `np.trapz` usage detected by the CI [skip cirrus] [skip circle]

MAINT: PR 19001 revisions

5d9441c

* fix a few testing issues: NumPy `trapezoid` usage is now conditional on NumPy version, `test_roundtrip_not_nola` skipping reasons are now properly formatted, and `_spectral_helper` has a broader check on the array namespace [skip cirrus] [skip circle]

tylerjereddy commented Apr 28, 2024

View reviewed changes

rgommers reviewed Apr 30, 2024

View reviewed changes

This was referenced May 9, 2024

ENH: signal: add array API support #20678

Open

EHN: signal.window: add array API support #20668

Open

lucascolley changed the title ~~WIP, ENH: signal array API support (test_spectral)~~ WIP, ENH: signal: test_spectral array API support May 17, 2024

	val = xp.asarray([]) if SCIPY_ARRAY_API else []
	val = xp.asarray([])

		is_complex = x.dtype in {xp.complex64, xp.complex128}

		if is_complex:

	from scipy._lib.array_api_compat import array_api_compat
	from scipy._lib import array_api_compat

WIP, ENH: signal: test_spectral array API support #19001

Are you sure you want to change the base?

WIP, ENH: signal: test_spectral array API support #19001

Conversation

tylerjereddy commented Aug 1, 2023 • edited by lucascolley Loading

ilayn left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ev-br commented Aug 1, 2023

h-vetinari commented Aug 2, 2023 • edited Loading

lucascolley left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lucascolley Sep 7, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tylerjereddy commented Oct 27, 2023

ilayn commented Oct 28, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tylerjereddy commented Oct 29, 2023

lucascolley Oct 29, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tylerjereddy commented Oct 31, 2023

tylerjereddy commented Nov 2, 2023

tylerjereddy commented Nov 6, 2023

lucascolley commented Nov 14, 2023 • edited Loading

tylerjereddy commented Nov 20, 2023

Choose a reason for hiding this comment

tylerjereddy left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tylerjereddy commented Apr 28, 2024

lucascolley commented Apr 28, 2024

tylerjereddy commented Apr 28, 2024

Choose a reason for hiding this comment

DietBru commented May 23, 2024 • edited Loading

WIP, ENH: signal: `test_spectral` array API support #19001

WIP, ENH: signal: `test_spectral` array API support #19001

tylerjereddy commented Aug 1, 2023 •

edited by lucascolley

Loading

h-vetinari commented Aug 2, 2023 •

edited

Loading

lucascolley Sep 7, 2023 •

edited

Loading

lucascolley Oct 29, 2023 •

edited

Loading

lucascolley commented Nov 14, 2023 •

edited

Loading

DietBru commented May 23, 2024 •

edited

Loading