Added support for multidimensional array indexing #8491

kc611 · 2022-10-05T12:20:52Z

This PR builds on top of #8238

As titled, this PR adds support for multidimensional indices while indexing NumPy arrays in Numba JIT functions:

import numba
import numpy as np

@numba.njit
def func(x, idx):
    return x[:, 2, idx, :, :]

a = np.random.randint(0, 100, (10, 11, 12, 13, 14))
b = np.random.randint(0, 10, (4, 5)) # Previously, these were limited to 1-D.

print(func(a, b).shape) 
# (10, 4, 5, 13, 14)
print(np.allclose(func(a, b), func.py_func(a, b)))
# True

kc611 · 2022-11-07T17:53:42Z

The memory leak I was facing in this issue has been isolated:

import numpy as np
import unittest
from numba import njit
from numba.tests.support import MemoryLeakMixin, TestCase

class TestFancyIndexingMultiDim(MemoryLeakMixin, TestCase):
    shape = (4, 5, 6)
    
    def check_setitem_indices(self, arr_shape, index):
        @njit     
        def set_item(array, idx, item):
            array[idx] = item

        arr = np.random.randint(0, 11, size=arr_shape)
        src = arr[index]
        expected = np.zeros_like(arr)
        got = np.zeros_like(arr)

        set_item.py_func(expected, index, src)
        set_item(got, index, src)

        np.testing.assert_equal(got, expected)

    def test_setitem_with_tuple(self):
        # When index is an array within a tuple
        idx = (np.array([1,2,3]),)
        # No memory leak
        self.check_setitem_indices(self.shape, idx)

    def test_setitem_without_tuple(self):
        # When index itself is an array, it is
        # supposed to be implicitly cast to same form as
        # the example above i.e. both should and do 
        # give same answer, except for the leak
        idx = np.array([1,2,3])
        # Memory leaks
        self.check_setitem_indices(self.shape, idx)
    
if __name__ == '__main__':
    unittest.main()

This I presume is a consequence of a fix for memory leak that was happening earlier and was 'fixed' by having the following decrefs in place

https://github.com/numba/numba/pull/8491/files#diff-d0aefd08783016c4e2a19e29d4725c7870806ea2aa1ab4caa6e8f0adc08c6e64R1717-R1721

I heavily suspect that for some reason these decrefs aren't being triggered in the particular case given above.

Co-authored-by: stuartarchibald <stuartarchibald@users.noreply.github.com>

…ng pass

stuartarchibald · 2023-03-17T16:28:15Z

@kc611 please could you resolve conflicts against main? As #8238 is merged the diff will hopefully shrink. Many thanks.

stuartarchibald

Thanks for the PR @kc611, great to see this implemented. I've given this an initial review and have also done some initial manual testing. Once the comments are address I'll look at the implementation details more closely, but from a cursory inspection the approach seems like it should work. Thanks again!

numba/core/typing/arraydecl.py

numba/np/arrayobj.py

stuartarchibald · 2023-04-18T10:03:59Z

numba/np/arrayobj.py

-    # is being accessed, during setitem they are used as source
-    # indices
-    counts = list(counts)
+    if src_type == 'buffer':


I don't think that this path in its present form can be supported on CUDA as the flat_imp* function compiled below relies on:

NumPy functions

CPU only implementations for reshape

returns an array

needs the NRT

I'm not sure of the best way to "detect" this, immediate thoughts are to either require the NRT and raise if the context doesn't have it, or alternatively declare these functions as overloads targetting CPU only which will then fail for CUDA users. CC @gmarkall do you have an opinion on this?

stuartarchibald · 2023-04-18T10:14:11Z

numba/np/arrayobj.py

+            context.nrt.decref(builder, _indexer.idxty,
+                               _indexer.idxary_instr)


As per a later comment, this requires the presence of the NRT which isn't guaranteed and needs guarding against.

Resolving this can be postponed as the 3 points in #8491 (review) will cover it.

stuartarchibald · 2023-04-18T10:22:51Z

numba/np/arrayobj.py

@@ -1658,25 +1684,8 @@ def fancy_setslice(context, builder, sig, args, index_types, indices):
            msg = "cannot assign slice from input of different size"
            context.call_conv.return_user_exc(builder, ValueError, (msg,))

-        # Check for array overlap
-        src_start, src_end = get_array_memory_extents(context, builder, srcty,


Given this check is removed, does the proposed implementation correctly handle "overlap"?

Define overlap in context of arrays ?

I think in this case it would be if the source and destination share any of the same memory location.

With the current implementation, I don't think that's possible since fancy indexing always creates a copy of the data. i.e. In fancy indexing the source and destination array never share memory. This is stated in the NumPy documentation as well.

I'm in-fact curious as of why it was the case over here that they did.

I suppose the question is... in Numba, what if in practice they do share memory? Assessing and resolving this can be deferred to a subsequent PR. @DrTodd13 was asking about functions for assessing whether arrays overlap at the Numba public meeting last week. The associated functions and algorithms in NumPy look like they are something Numba could support, but are out of scope for implementing in this PR.

stuartarchibald · 2023-04-18T10:29:23Z

numba/tests/test_fancy_indexing.py

It would be good for the testing to now also include indexing into F-order and A-order arrays?

Alright will add that.

Co-authored-by: stuartarchibald <stuartarchibald@users.noreply.github.com>

sklam · 2023-04-26T15:27:29Z

numba/np/arrayobj.py

+            self.idxty = idxty
+            self.idxary = idxary
+
+        assert self.idxty.ndim == 1, <message>


…nt for buffer type indices

stuartarchibald

Thanks for the updates @kc611. I've given them a review and I think there's a few things to be resolved but otherwise looks good. The unit tests that are failing CI are failing due to the use of compile_isolated and Flags(). To fix, this:

numba/numba/tests/test_indexing.py

Lines 15 to 16 in 74cb715

    
           enable_pyobj_flags = Flags() 
        
           enable_pyobj_flags.enable_pyobject = True

could have:

enable_pyobj_flags.nrt = True

adding. Or alternatively, as object mode fallback is deprecated, forced object mode testing could be achieved by using alternative flags such as those available through importing these from testing support:

numba/numba/tests/support.py

Lines 70 to 71 in 74cb715

    
           force_pyobj_flags = Flags() 
        
           force_pyobj_flags.force_pyobject = True

Hope this helps?

docs/source/reference/numpysupported.rst

numba/core/typing/arraydecl.py

numba/np/arrayobj.py

stuartarchibald · 2023-05-01T16:36:11Z

numba/np/arrayobj.py

+            res = context.compile_internal(builder, flat_imp, sig,
+                                           (idxary._getvalue(),))


I think the second part of this still needs doing, i.e. move the "flat" implementations to module level so as to make use of the compilation cache.

numba/np/arrayobj.py

stuartarchibald · 2023-05-01T16:40:28Z

numba/np/arrayobj.py

+                self.context.typing_context, {idxty}, {},
+            )
+            impl = self.context.get_function(fnop, callsig)
+            res = impl(self.builder, (idxary._getvalue(),))
            self.idxty = retty


To be consistent, I think retty should be derived from the callsig, specifically, it should be callsig.return_type?

stuartarchibald · 2023-05-01T16:51:43Z

numba/tests/test_fancy_indexing.py

@@ -306,29 +306,36 @@ class TestFancyIndexingMultiDim(MemoryLeakMixin, TestCase):
    shape = (5, 6, 7, 8, 9, 10)
    indexing_cases = [
        # Slices + Integers
-        (slice(4, 5), 3, np.array([0,1,3,4,2]), 1),
+        (slice(4, 5), 3, np.array([0, 1, 3, 4, 2]), 1),


Please could this be done throughout? Numba uses spaces after commas in this context.

numba/core/typing/arraydecl.py

numba/np/arrayobj.py

Co-authored-by: stuartarchibald <stuartarchibald@users.noreply.github.com>

stuartarchibald

Thanks for the updates @kc611, few minor things to resolve now else looks good. OOB I spoke with @gmarkall about the impact of the use of the NRT on the CUDA target. The conclusion is as follows:

It's probably best to concentrate on getting this and the next related PR ([WIP] Advanced Indexing #3: Added support for multiple multidimensional Indices #8912) merged as the implementation needs to be figured out to start with and it's almost certainly easier to do this on the CPU with the NRT present.
Go back to e.g. the release0.57 branch and see what sort of "fancy indexing" actually worked on CUDA, write tests for this.
Get the tests in 2. working on main once these PRs are merged. I suspect it'll only be the case where the index is a non-contiguous array that's a problem as the copy is currently needed as part of "flattening" it. Once we have the tests I expect the options and limitations will become more clear.

numba/tests/test_fancy_indexing.py

numba/tests/test_indexing.py

gmarkall · 2023-05-03T16:03:22Z

gpuci run tests

Co-authored-by: stuartarchibald <stuartarchibald@users.noreply.github.com>

stuartarchibald

Many thanks for all your efforts on this @kc611. It's great to see this feature implemented. There's a few outstanding issues that I've left as a "review", but these should be captured in a new issue for completion in subsequent work. Most of the outstanding items relate to use of the Numba runtime which is going to take careful assessment. As alluded to previously, this will manifest in having to write a number of tests for the CUDA target to ensure that no regressions are introduced. Thanks again for work on this challenging implementation and feature, patch is approved!

stuartarchibald · 2023-05-04T15:14:08Z

numba/np/arrayobj.py

+            if not context.enable_nrt:
+                raise NotImplementedError("This type of indexing is not"
+                                          " currently supported for"
+                                          " given compiler target.")


Ideally this would be tested, but I'm inclined to leave this to a subsequent PR as I am relatively sure it's going to need to be moved.

stuartarchibald · 2023-05-04T15:14:30Z

numba/np/arrayobj.py

+            raise NotImplementedError("This type of indexing is not currently"
+                                      " supported for given compiler target.")


As above, ideally this would be tested, but I'm inclined to leave this to a subsequent PR as I am relatively sure it's going to need to be moved.

stuartarchibald · 2023-05-04T15:15:48Z

numba/np/arrayobj.py

+            context.nrt.decref(builder, _indexer.idxty,
+                               _indexer.idxary_instr)


Resolving this can be postponed as the 3 points in #8491 (review) will cover it.

stuartarchibald · 2023-05-04T15:20:29Z

numba/np/arrayobj.py

@@ -1658,25 +1684,8 @@ def fancy_setslice(context, builder, sig, args, index_types, indices):
            msg = "cannot assign slice from input of different size"
            context.call_conv.return_user_exc(builder, ValueError, (msg,))

-        # Check for array overlap
-        src_start, src_end = get_array_memory_extents(context, builder, srcty,


I suppose the question is... in Numba, what if in practice they do share memory? Assessing and resolving this can be deferred to a subsequent PR. @DrTodd13 was asking about functions for assessing whether arrays overlap at the Numba public meeting last week. The associated functions and algorithms in NumPy look like they are something Numba could support, but are out of scope for implementing in this PR.

stuartarchibald · 2023-05-04T15:21:47Z

numba/np/arrayobj.py


-    # Cast to the destination dtype (cross-dtype slice assignment is allowed)
-    val = context.cast(builder, val, src_dtype, aryty.dtype)
+        flat_imp = njit(flat_imp)


This needs moving to a module global in a subsequent PR.

stuartarchibald · 2023-05-04T15:29:08Z

numba/np/arrayobj.py

+        for _indexer in indexer.indexers:
+            if isinstance(_indexer, IntegerArrayIndexer) \
+               and hasattr(_indexer, "idxary_instr"):
+                context.nrt.decref(builder, _indexer.idxty,


Use of NRT needs assessing, defer to later PR.

stuartarchibald · 2023-05-04T15:33:15Z

numba/tests/test_fancy_indexing.py

+        (Ellipsis, 1, np.array([0, 1, 3, 4, 2], order='A'), 3, slice(1, 5)),
+        (np.array([0, 1, 3, 4, 2], order='F'), 3, Ellipsis, slice(1, 5)),
+        (np.array([[0, 1, 3, 4, 2], [0, 1, 2, 3, 2], [3, 1, 3, 4, 1]], order='A'),


Setting order='A' in the NumPy constructor won't have the effect of it being A ordered within Numba's type system (assuming that was the intent?) Suggest deferring fixing this to a later PR.

kc611 requested review from sklam and stuartarchibald as code owners October 5, 2022 12:20

sklam added the 2 - In Progress label Oct 6, 2022

kc611 added 3 - Ready for Review and removed 2 - In Progress labels Oct 24, 2022

stuartarchibald assigned gmarkall, stuartarchibald and sklam Oct 25, 2022

kc611 and others added 11 commits November 7, 2022 23:26

Refactored Fancy Index selection and tests

4e4edc6

Fixed Error message in tests

eca602f

Apply suggestions from code review

f2ca8bb

Co-authored-by: stuartarchibald <stuartarchibald@users.noreply.github.com>

Readded the old test cases and refactored the index generation methods

415f7a7

Fixed boolean testing

ed8b600

Removed whitespaces

1d680f1

Added new tests as a separate class

1aee4d8

Corrected flake8 violations

f8a2e82

Added support for multidimensional array indexing

b81dc0b

Fixed setitem logic to accomodate multidimensional arrays

3297fef

Added fix for memory leak in setitem

1ee0d99

kc611 force-pushed the adv_idx_2 branch from 8ae2124 to 1ee0d99 Compare November 7, 2022 17:58

kc611 mentioned this pull request Nov 22, 2022

Meta Issue: Support for Advanced/Fancy Indexing in Numba #8616

Open

5 tasks

kc611 added 5 commits January 26, 2023 11:09

Fix for memory leak: Added metadata to ignore unexpected nrt refpruni…

cdc663e

…ng pass

Fixed setiitem logic for scalar arrays

cee23af

Fixed sequence type handling in fancy setslice

f0b0713

Blocked implicit multidimensional boolean indexing

479703b

Merge remote-tracking branch 'upstream/main' into adv_idx_2

29f0fab

stuartarchibald added 4 - Waiting on author Waiting for author to respond to review and removed 3 - Ready for Review labels Mar 17, 2023

Merge remote-tracking branch 'upstream/main' into adv_idx_2

b1065fc

stuartarchibald reviewed Apr 18, 2023

View reviewed changes

stuartarchibald added the Effort - long Long size effort needed label Apr 18, 2023

stuartarchibald added this to the Numba 0.58 RC milestone Apr 18, 2023

kc611 mentioned this pull request Apr 22, 2023

[WIP] Advanced Indexing #3: Added support for multiple multidimensional Indices #8912

Closed

Apply suggestions from code review

8a2922d

Co-authored-by: stuartarchibald <stuartarchibald@users.noreply.github.com>

sklam reviewed Apr 26, 2023

View reviewed changes

numba/np/arrayobj.py Outdated

self.idxty = idxty

self.idxary = idxary

assert self.idxty.ndim == 1, <message>

Copy link

Member

sklam Apr 26, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Bad syntax

kc611 added 2 commits May 1, 2023 17:38

Fixed error messages

fdd36ba

Added testing for differently ordered arrays and made NRT a requireme…

5f90155

…nt for buffer type indices

stuartarchibald reviewed May 1, 2023

View reviewed changes

kc611 and others added 6 commits May 2, 2023 16:08

Apply suggestions from code review

d8c4c09

Co-authored-by: stuartarchibald <stuartarchibald@users.noreply.github.com>

Apply suggestions from code review

69d5958

Co-authored-by: stuartarchibald <stuartarchibald@users.noreply.github.com>

Addressed review comments

e482c91

Localize function definitions for flattening functions in fancy indexing

251f97d

Removed compile isolated API from indexing tests

50238c0

Corrected flags in indexing tests

172c21a

stuartarchibald reviewed May 3, 2023

View reviewed changes

kc611 and others added 3 commits May 4, 2023 14:40

Removed unnnecessary double testing

fc6d98f

Apply suggestions from code review

33df606

Co-authored-by: stuartarchibald <stuartarchibald@users.noreply.github.com>

Removed dead imports

6b6e364

stuartarchibald approved these changes May 4, 2023

View reviewed changes

stuartarchibald added 5 - Ready to merge Review and testing done, is ready to merge and removed 4 - Waiting on author Waiting for author to respond to review labels May 4, 2023

sklam mentioned this pull request May 4, 2023

error from getitem with optional array #7903

Open

2 tasks

kc611 mentioned this pull request May 4, 2023

Checklist of deferred items from Advanced Indexing PRs #8941

Open

4 tasks

sklam changed the base branch from main to fea/adv_indexing May 9, 2023 16:51

sklam merged commit 5c70e21 into numba:fea/adv_indexing May 9, 2023
21 checks passed

esc mentioned this pull request Jun 6, 2023

Indexing subspace regression #8999

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added support for multidimensional array indexing #8491

Added support for multidimensional array indexing #8491

kc611 commented Oct 5, 2022

kc611 commented Nov 7, 2022 •

edited

Loading

stuartarchibald commented Mar 17, 2023

stuartarchibald left a comment

stuartarchibald Apr 18, 2023

stuartarchibald Apr 18, 2023

stuartarchibald May 4, 2023

stuartarchibald Apr 18, 2023

kc611 Apr 24, 2023

stuartarchibald Apr 26, 2023

kc611 May 1, 2023

stuartarchibald May 4, 2023

stuartarchibald Apr 18, 2023

kc611 Apr 24, 2023

sklam Apr 26, 2023

stuartarchibald left a comment

stuartarchibald May 1, 2023

stuartarchibald May 1, 2023

stuartarchibald May 1, 2023

stuartarchibald left a comment

gmarkall commented May 3, 2023

stuartarchibald left a comment

stuartarchibald May 4, 2023

stuartarchibald May 4, 2023

stuartarchibald May 4, 2023

stuartarchibald May 4, 2023

stuartarchibald May 4, 2023

stuartarchibald May 4, 2023

stuartarchibald May 4, 2023

		context.nrt.decref(builder, _indexer.idxty,
		_indexer.idxary_instr)

	enable_pyobj_flags = Flags()
	enable_pyobj_flags.enable_pyobject = True

	force_pyobj_flags = Flags()
	force_pyobj_flags.force_pyobject = True

		res = context.compile_internal(builder, flat_imp, sig,
		(idxary._getvalue(),))

		raise NotImplementedError("This type of indexing is not currently"
		" supported for given compiler target.")

Added support for multidimensional array indexing #8491

Added support for multidimensional array indexing #8491

Conversation

kc611 commented Oct 5, 2022

kc611 commented Nov 7, 2022 • edited Loading

stuartarchibald commented Mar 17, 2023

stuartarchibald left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stuartarchibald left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stuartarchibald left a comment

Choose a reason for hiding this comment

gmarkall commented May 3, 2023

stuartarchibald left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kc611 commented Nov 7, 2022 •

edited

Loading