BUG: Fix underflow error in AVX512 implementation of ufunc exp/f64 #18933

seiko2plus · 2021-05-07T08:22:58Z

closes #18932, related to #18920, #18916.

seiko2plus · 2021-05-07T09:21:02Z

@seberg, CI(use_wheel (pull_request)) ufunc tests complaining about:

 >               assert_array_equal(res_num.astype("O"), res_obj)
E               AssertionError: 
E               Arrays are not equal
E               
E               Mismatched elements: 1 / 1 (100%)
E               Max absolute difference: 2.7755575615628914e-17
E               Max relative difference: 1.1489924412333298e-16
E                x: array([-0.2415644752704905], dtype=object)
E                y: array([-0.24156447527049046], dtype=object)

MyFloat    = <class 'numpy.core.tests.test_ufunc.TestUfuncGenericLoops.test_unary_PyUFunc_O_O_method_full.<locals>.MyFloat'>
num_arr    = array([0.78539816])
obj_arr    = array([0.7853981633974483], dtype=object)
res_num    = array([-0.24156448])
res_obj    = array([-0.24156447527049046], dtype=object)
self       = <numpy.core.tests.test_ufunc.TestUfuncGenericLoops object at 0x7f897731c950>
ufunc      = <ufunc 'log'>
val        = 0.7853981633974483

why we don't use assert_array_almost_equal in here:

numpy/numpy/core/tests/test_ufunc.py

Lines 177 to 178 in 6ad4650

    
           res_obj = ufunc(obj_arr) 
        
           assert_array_equal(res_num.astype("O"), res_obj)

mattip · 2021-05-07T09:22:16Z

Yes, that seems too strong a requirement.

seiko2plus · 2021-05-07T09:51:22Z

@mattip, Should I replace it with assert_array_almost_equal? I don't think this error is related to this patch.

seiko2plus · 2021-05-07T09:59:05Z

close/open for another round.

charris · 2021-05-07T13:18:58Z

LGTM. Go ahead and fix the failing test so everything starts working again.

seberg · 2021-05-07T13:49:31Z

Oh, you changed the test to almost-equal. I had problems elsewhere and changed it to 0-D, but this is just as well. EDIT: Sorry, that was unclear. That was in a work-in-progress, lets go with this! :)

(I think I may have changed 0-stride vs. contiguous-stride for 0-D, which probably can change whether the SIMD loop is used sometimes?)

seberg · 2021-05-07T14:15:42Z

Thanks for the quick fix, Sayed!

seiko2plus · 2021-05-07T14:21:13Z

@seberg,

which probably can change whether the SIMD loop is used sometimes?)

That probably what happened, but avoid using almost_equal is kinda strict especially for CPUs that don't have native support of FMA/Rounding.

r-devulap · 2021-05-07T16:09:44Z

numpy/core/src/umath/loops_exponent_log.dispatch.c.src

@@ -776,7 +776,7 @@ AVX512F_exp_DOUBLE(npy_double * op,
        nearzero_mask = _mm512_kxor(nearzero_mask, nan_mask);
        overflow_mask = _mm512_kor(overflow_mask,
                                _mm512_kxor(xmax_mask, inf_mask));
-        underflow_mask = _mm512_kor(underflow_mask, xmax_mask);
+        underflow_mask = _mm512_kor(underflow_mask, xmin_mask);


oh yikes! /me hides in shame.
Thanks @seiko2plus for fixing this.

BUG: Fix underflow error in AVX512 of ufunc exp

dc062bd

github-actions bot added the 00 - Bug label May 7, 2021

seiko2plus changed the title ~~BUG: Fix underflow error in AVX512 of ufunc exp~~ BUG: Fix underflow error in AVX512 implementation of ufunc exp/f64 May 7, 2021

seiko2plus added the component: SIMD Issues in SIMD (fast instruction sets) code or machinery label May 7, 2021

seiko2plus closed this May 7, 2021

seiko2plus reopened this May 7, 2021

TST: Use almost equal to get rid of object -> float64 comparison error

a25f1ab

seiko2plus merged commit cd73ab7 into numpy:main May 7, 2021

r-devulap reviewed May 7, 2021

View reviewed changes

h-vetinari mentioned this pull request May 25, 2021

remove obsolete test skip conda-forge/numpy-feedstock#235

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: Fix underflow error in AVX512 implementation of ufunc exp/f64 #18933

BUG: Fix underflow error in AVX512 implementation of ufunc exp/f64 #18933

seiko2plus commented May 7, 2021 •

edited

seiko2plus commented May 7, 2021 •

edited

mattip commented May 7, 2021

seiko2plus commented May 7, 2021

seiko2plus commented May 7, 2021

charris commented May 7, 2021

seberg commented May 7, 2021 •

edited

seberg commented May 7, 2021

seiko2plus commented May 7, 2021 •

edited

r-devulap May 7, 2021

BUG: Fix underflow error in AVX512 implementation of ufunc exp/f64 #18933

BUG: Fix underflow error in AVX512 implementation of ufunc exp/f64 #18933

Conversation

seiko2plus commented May 7, 2021 • edited

seiko2plus commented May 7, 2021 • edited

mattip commented May 7, 2021

seiko2plus commented May 7, 2021

seiko2plus commented May 7, 2021

charris commented May 7, 2021

seberg commented May 7, 2021 • edited

seberg commented May 7, 2021

seiko2plus commented May 7, 2021 • edited

r-devulap May 7, 2021

Choose a reason for hiding this comment

seiko2plus commented May 7, 2021 •

edited

seiko2plus commented May 7, 2021 •

edited

seberg commented May 7, 2021 •

edited

seiko2plus commented May 7, 2021 •

edited