Use stream in mul_add if given and allocator in subset_sum #438

vincefn · 2024-05-05T18:09:23Z

Hi Andreas,

Somehow gpuarrays' mul_add did not use the supplied stream parameter. Similarly, the allocator parameter of subset_sum was not used either.

This PR should fix that, and also adds an optional out parameter to mul_add.

subset_sum(): use allocator if given.

inducer

Thanks! A couple nits, then this should be good to go.

inducer · 2024-05-06T00:29:53Z

pycuda/gpuarray.py

@@ -2087,7 +2087,8 @@ def subset_sum(subset, a, dtype=None, stream=None, allocator=None):
    from pycuda.reduction import get_subset_sum_kernel

    krnl = get_subset_sum_kernel(dtype, subset.dtype, a.dtype)
-    return krnl(subset, a, stream=stream)
+    return krnl(subset, a, stream=stream,
+                allocator=drv.mem_alloc if allocator is None else allocator)


This should try to get the allocator off of one of the two arrays.

It could use a's allocator.

On the other hand, looking at other reduction functions (sum, all, any,...), they all just pass allocator to the kernel, even if it's None. So maybe we should do that here for consistency.

(I actually added the if allocator is None... by mistake - the kernel call works with allocator=None, it's only some functions like to_gpu which need a real allocator)

Hi @inducer - I've pushed a87b22f which gives the same behavior as other functions (just pass the function's allocator parameter, even if None).
Let me know if you'd rather want to use the first array's allocator, but in that case the other functions may need to be changed as well.

That sounds good. Thanks!

test/test_gpuarray.py

pycuda/gpuarray.py

Co-authored-by: Andreas Klöckner <inform@tiker.net>

…ions

vincefn added 2 commits May 5, 2024 19:28

mul_add(): use stream if given, add an optional destination array.

1b85fd7

subset_sum(): use allocator if given.

Remove unused import, add whitespace to make flake8 happy

846affb

inducer force-pushed the main branch from 479a030 to 846affb Compare May 6, 2024 00:28

inducer reviewed May 6, 2024

View reviewed changes

vincefn and others added 3 commits May 6, 2024 10:27

Update pycuda/gpuarray.py

1cc39be

Co-authored-by: Andreas Klöckner <inform@tiker.net>

test_subset_sum: also assert if allocator is used

88acc4e

subset_sum: use given allocator value (even if None) like other funct…

a87b22f

…ions

inducer approved these changes May 8, 2024

View reviewed changes

inducer merged commit 8aa0766 into inducer:main May 8, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use stream in mul_add if given and allocator in subset_sum #438

Use stream in mul_add if given and allocator in subset_sum #438

vincefn commented May 5, 2024

inducer left a comment

inducer May 6, 2024

vincefn May 6, 2024 •

edited

vincefn May 7, 2024

inducer May 8, 2024

Use stream in mul_add if given and allocator in subset_sum #438

Use stream in mul_add if given and allocator in subset_sum #438

Conversation

vincefn commented May 5, 2024

inducer left a comment

Choose a reason for hiding this comment

inducer May 6, 2024

Choose a reason for hiding this comment

vincefn May 6, 2024 • edited

Choose a reason for hiding this comment

vincefn May 7, 2024

Choose a reason for hiding this comment

inducer May 8, 2024

Choose a reason for hiding this comment

vincefn May 6, 2024 •

edited