Fixes to the indexed reduce function for the cpu, opencl and cuda back ends #3658

christophe-murphy · 2025-06-03T20:41:46Z

These back ends were incorrectly assuming a linear array when finding the max/min value and its index. This causes an issue with a sub-array.

Description

In the case of the cuda and opencl back end this was just for the cpu-fallback methods which are used when the total number of elements in the array is less than or equal to 4096. There was no issue with the oneapi back end because it does not have this fall-back method. The fix for the cuda and opencl back ends was to only use the fall-back method for linear arrays. This is to avoid doing a complex sub-array copy operation from device to host memory.

Changes to Users

af_imax_all / af_imin_all functions now works for sub-arrays

Checklist

Rebased on latest master
Code compiles
Tests pass

…k ends. These back ends were incorrectly assuming a linear array. In the case of the cuda and opencl back end this was just for the cpu-fallback methods which are used when the total number of elements in the array is less than or equal to 4096.

src/backend/cpu/ireduce.cpp

Added an eval() to the input array on the CPU back end for the ireduce method to ensure that the array has been evaluated before reducing.

christophe-murphy linked an issue Jun 3, 2025 that may be closed by this pull request

[BUG] af::max of subarray returns wrong location #3656

Closed

christophe-murphy added this to the 3.10 milestone Jun 3, 2025

syurkevi previously approved these changes Jun 3, 2025

View reviewed changes

willyborn reviewed Jun 9, 2025

View reviewed changes

src/backend/cpu/ireduce.cpp Show resolved Hide resolved

Ensure array is evaluated before reducing

3a081e9

Added an eval() to the input array on the CPU back end for the ireduce method to ensure that the array has been evaluated before reducing.

christophe-murphy dismissed syurkevi’s stale review via 3a081e9 June 13, 2025 16:43

syurkevi approved these changes Jun 24, 2025

View reviewed changes

christophe-murphy merged commit 0e8a690 into master Jun 25, 2025
1 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixes to the indexed reduce function for the cpu, opencl and cuda back ends #3658

Fixes to the indexed reduce function for the cpu, opencl and cuda back ends #3658

Uh oh!

christophe-murphy commented Jun 3, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Fixes to the indexed reduce function for the cpu, opencl and cuda back ends #3658

Fixes to the indexed reduce function for the cpu, opencl and cuda back ends #3658

Uh oh!

Conversation

christophe-murphy commented Jun 3, 2025

Description

Changes to Users

Checklist

Uh oh!

Uh oh!

Uh oh!

Uh oh!