Add zero-checks to axpy-like operations #1573

upsj · 2024-03-18T02:11:21Z

Operations like $\alpha A x + 0 y$ may propagate NaNs from y to the output despite the 0 coefficient. This can be avoided by checking the beta scaling factors for zero explicitly.

TODO:

add reference tests for matrices
add generic tests for Dense

This prevents NaNs from polluting the output

fritzgoebel

LGTM

MarcelKoch

lgtm in general. Maybe there are still some static_cast missing to make it compile.

MarcelKoch · 2024-04-08T11:38:54Z

common/cuda_hip/matrix/csr_kernels.hpp.inc

+        [&beta_val](const type& x) {
+            return is_zero(beta_val) ? zero(beta_val) : beta_val * x;
+        });


minor performance comment, you could try switching the lambda and zero check, i.e.

is_zero(beta) ? [&beta_val](const type& x) { return zero(beta); } : [&beta_val](const type& x) { return beta_val * x; }

But this might not work, since the two branches of the ?: operator have different types. And it might increase compile times, since it might compile the kernel two times

MarcelKoch · 2024-04-08T12:12:20Z

reference/matrix/ell_kernels.cpp

-            arithmetic_type result = c->at(row, j);
-            result *= beta_val;
+            arithmetic_type result =
+                is_zero(beta_val) ? zero(beta_val) : beta_val * c->at(row, j);


Suggested change

is_zero(beta_val) ? zero(beta_val) : beta_val * c->at(row, j);

is_zero(beta_val) ? zero(beta_val) : beta_val * static_cast<arithmetic_type>(c->at(row, j));

to make it compile

yhmtsai · 2024-04-15T07:03:53Z

Is there any reason to avoid propagation of NaN? If there's no performance penalty, I think propagation of NaN is easier to know the algorithm does not work out due to some arthmetic error.

MarcelKoch · 2024-04-15T08:07:43Z

@yhmtsai if you are computing for example y = 0.5 * A * x + 0.0 * y then propagating NaNs from y is unnecessary, since it's mathematical equivalent to just leaving y out. This can easily happen, if y is not initialized before that computation.

yhmtsai · 2024-04-15T08:19:13Z

no, 0 * NaN should be NaN not zero, so it is not mathimatical equality by just leaving them out.
Yes, it might happen in unitialized memory, but I would say it should be properly initialized or using the call without touching the unitialization put. (for us, we may proparbably provide A->apply(alpha, x, y) for y = alpha * A * x)

yhmtsai · 2024-04-15T08:23:25Z

I know current vendor library usually treat 0 as do not touch due to BLAS.
I am not sure the other routines hold the same rule

upsj · 2024-04-15T11:20:35Z

0 * NaN should be NaN not zero

that makes calculations more fragile, we already do similar things (special cases) for zeros inside our solver kernels

MarcelKoch · 2024-04-15T11:49:52Z

@yhmtsai we treat 0 * NaN = 0 only in the case of axpy-like operation. So there will still be NaN propagation in normal SpMV, dot-products, etc. But for these axpy-like operations, users will not care about the IEEE standard. For them, initializing x by setting it to 0 and multiplying it with 0 should be equivalent.

add zero-checks to axpy-like operations

f7c8214

This prevents NaNs from polluting the output

upsj self-assigned this Mar 18, 2024

fritzgoebel approved these changes Mar 25, 2024

View reviewed changes

MarcelKoch self-requested a review April 5, 2024 09:27

yhmtsai self-requested a review April 5, 2024 09:33

MarcelKoch added this to the Ginkgo 1.8.0 milestone Apr 5, 2024

MarcelKoch approved these changes Apr 8, 2024

View reviewed changes

tcojean modified the milestones: Ginkgo 1.8.0, Ginkgo 1.9.0 May 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add zero-checks to axpy-like operations #1573

Add zero-checks to axpy-like operations #1573

upsj commented Mar 18, 2024

fritzgoebel left a comment

MarcelKoch left a comment

MarcelKoch Apr 8, 2024

MarcelKoch Apr 8, 2024

yhmtsai commented Apr 15, 2024

MarcelKoch commented Apr 15, 2024

yhmtsai commented Apr 15, 2024

yhmtsai commented Apr 15, 2024

upsj commented Apr 15, 2024

MarcelKoch commented Apr 15, 2024

	is_zero(beta_val) ? zero(beta_val) : beta_val * c->at(row, j);
	is_zero(beta_val) ? zero(beta_val) : beta_val * static_cast<arithmetic_type>(c->at(row, j));

Add zero-checks to axpy-like operations #1573

Are you sure you want to change the base?

Add zero-checks to axpy-like operations #1573

Conversation

upsj commented Mar 18, 2024

fritzgoebel left a comment

Choose a reason for hiding this comment

MarcelKoch left a comment

Choose a reason for hiding this comment

MarcelKoch Apr 8, 2024

Choose a reason for hiding this comment

MarcelKoch Apr 8, 2024

Choose a reason for hiding this comment

yhmtsai commented Apr 15, 2024

MarcelKoch commented Apr 15, 2024

yhmtsai commented Apr 15, 2024

yhmtsai commented Apr 15, 2024

upsj commented Apr 15, 2024

MarcelKoch commented Apr 15, 2024