Implement `array` for sequence types #615

IvanIsCoding · 2022-05-22T01:16:40Z

Fixes #614

Implements an explicit conversion from our custom types to Numpy arrays

This a quick fix to support converting using np.array and np.asarray using the API specified by NumPy. The solution is short and not too hard to maintain, the only burden will be remembering to add py_convert_to_py_array_obj_impl! for each new sequence return type we create.

Tasks:

Return 1d array of objects intead of raising not implemented error for some types
Handle dtype argument
Test things more thoroughly

coveralls · 2022-05-22T01:41:39Z

Pull Request Test Coverage Report for Build 2521772788

30 of 31 (96.77%) changed or added relevant lines in 1 file are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.02%) to 97.171%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
src/iterators.rs	30	31	96.77%

Totals
Change from base Build 2494694599:	0.02%
Covered Lines:	12674
Relevant Lines:	13043

💛 - Coveralls

georgios-ts

I'd rather see this fixed in pyo3 since numpy can automatically handle nested sequence objects but in general the approach here looks good.

georgios-ts · 2022-05-23T10:33:56Z

src/iterators.rs

+    ($($t:ty)*) => ($(
+        impl PyConvertToPyArray for Vec<$t> {
+            fn convert_to_pyarray(&self, py: Python) -> PyResult<PyObject> {
+                Ok(self.clone().to_pyarray(py).into())


Is there any downside of using IntoPyArray which doesn't allocate more memory? https://docs.rs/numpy/latest/numpy/array/struct.PyArray.html#memory-location

Suggested change

Ok(self.clone().to_pyarray(py).into())

Ok(self.clone().into_pyarray(py).into())

Yeah it depends on what we want to do here, if we want to return a copy of the inner data structure I would say lets drop the clone() and just call to_pyarray(). That will copy the data into a python/numpy allocated array so numpy will have full read/write on it. If we want the fastest return but at the cost of having some function restricted on the array we should drop the clone() and just do into_pyarray(py). I'm not sure what is more expected from numpy's __array__ I'm thinking probably the to_pyarray() path so it's an array copy of the data. But either way I don't think we need a clone() here

Actually, talking this over with @jakelishman I think we should just do self.into_pyarray(py).into() here and return a numpy view directly into the buffer without copying. If the numpy function requires writing inplace or something they can use the copy() method explicitly.

We can not drop clone() if we use IntoPyArray since it needs to take ownership of the vector but __array__ receives a ref &self (and pyo3 doesn't allow to receive self and for a good reason since you can not move data from Python).

At the C API level of Numpy, you can have an array wrap a raw pointer, and tell Numpy that some other Python-managed object owns the lifetime of that array (so it won't attempt to free the memory). You can also set flags in Numpy to make an array non-writeable. I guess it likely would need unsafe Rust to achieve that from an &self borrow because you can't tell the Rust compiler about Python/Numpy's guarantees, but that behaviour (with appropriate flags set) is sort-of compatible with an &self borrow, if you use the base attribute to tie the lifetime of the array to the lifetime of the underlying Rust object.

Yeah, the underlying trait method into_pyarray (which is part of the IntoPyArray @georgios-ts linked to) is doing exactly that for us. It uses an unsafe block to get the pointer: https://github.com/PyO3/rust-numpy/blob/19bfc9d2d0e72bb3ed30c08244ee81abd02d7386/src/convert.rs#L67

I think that's missing the lifetime/read-only ties, though, since it takes self not &mut self/&self.

@jakelishman Yeah, this is indeed possible and rust-numpy provides the unsafe borrow_from_array (which results in a writeable array but you can't resize it) so we can avoid cloning the data.

I switched to using into_pyarray

…y-fix

mtreinish

This LGTM, I agree with @georgios-ts having a fix in PyO3 would be the better path forward here. But having this in place now will avoid us from blocking the 0.12 release on this issue. We can revisit if we want this after the PYO3 issue is fixed and included in a release (I meant to try and tackle it but haven't had the time lately). The only thing I was on the fence about is having a release note to document we natively implement __array__ now on the custom return types. But I think I'm ok without it as I don't think most users will notice vs older releases where having the full sequence protocol implemented implicitly exposed an array form.

coveralls · 2022-08-09T17:24:16Z

Pull Request Test Coverage Report for Build 2826944969

30 of 31 (96.77%) changed or added relevant lines in 1 file are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.02%) to 97.102%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
src/iterators.rs	30	31	96.77%

Totals
Change from base Build 2806523038:	0.02%
Covered Lines:	12498
Relevant Lines:	12871

💛 - Coveralls

IvanIsCoding added 3 commits May 21, 2022 17:05

Add __array__ method that returns empty array

629ced6

Make __array__ method work for trivial cases

e48a42b

Add conversion for (usize, usize)

7703a87

IvanIsCoding requested a review from mtreinish May 22, 2022 01:16

Change name of the test

fe442ba

Add support for dtype and 1d array of objects

11aa9da

IvanIsCoding changed the title ~~[DRAFT] Implement __array__ for sequence types~~ Implement __array__ for sequence types May 22, 2022

georgios-ts reviewed May 23, 2022

View reviewed changes

IvanIsCoding and others added 9 commits May 23, 2022 13:49

Add tests

dee77d0

Merge branch 'main' into numpy-array-fix

38eda83

Merge branch 'main' into numpy-array-fix

bee5d6c

Merge branch 'main' into numpy-array-fix

5812367

Merge branch 'main' into numpy-array-fix

c8bba76

Merge remote-tracking branch 'origin/main' into numpy-array-fix

0667682

Merge remote-tracking branch 'origin/numpy-array-fix' into numpy-arra…

8aa9e29

…y-fix

Add tests to rustworkx as well

9732c50

Black

e475b17

mtreinish added this to the 0.12.0 milestone Aug 9, 2022

mtreinish approved these changes Aug 9, 2022

View reviewed changes

mtreinish added the automerge Queue a approved PR for merging label Aug 9, 2022

Merge branch 'main' into numpy-array-fix

526e021

mergify bot merged commit c6067ba into Qiskit:main Aug 9, 2022

IvanIsCoding deleted the numpy-array-fix branch August 9, 2022 21:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement `array` for sequence types #615

Implement `array` for sequence types #615

IvanIsCoding commented May 22, 2022 •

edited

coveralls commented May 22, 2022 •

edited

georgios-ts left a comment

georgios-ts May 23, 2022

mtreinish May 23, 2022

mtreinish May 23, 2022

georgios-ts May 23, 2022

jakelishman May 23, 2022 •

edited

mtreinish May 23, 2022

jakelishman May 23, 2022 •

edited

georgios-ts May 23, 2022

IvanIsCoding May 23, 2022

mtreinish left a comment

coveralls commented Aug 9, 2022

	Ok(self.clone().to_pyarray(py).into())
	Ok(self.clone().into_pyarray(py).into())

Implement __array__ for sequence types #615

Implement __array__ for sequence types #615

Conversation

IvanIsCoding commented May 22, 2022 • edited

coveralls commented May 22, 2022 • edited

Pull Request Test Coverage Report for Build 2521772788

💛 - Coveralls

georgios-ts left a comment

Choose a reason for hiding this comment

georgios-ts May 23, 2022

Choose a reason for hiding this comment

mtreinish May 23, 2022

Choose a reason for hiding this comment

mtreinish May 23, 2022

Choose a reason for hiding this comment

georgios-ts May 23, 2022

Choose a reason for hiding this comment

jakelishman May 23, 2022 • edited

Choose a reason for hiding this comment

mtreinish May 23, 2022

Choose a reason for hiding this comment

jakelishman May 23, 2022 • edited

Choose a reason for hiding this comment

georgios-ts May 23, 2022

Choose a reason for hiding this comment

IvanIsCoding May 23, 2022

Choose a reason for hiding this comment

mtreinish left a comment

Choose a reason for hiding this comment

coveralls commented Aug 9, 2022

Pull Request Test Coverage Report for Build 2826944969

💛 - Coveralls

Implement `array` for sequence types #615

Implement `array` for sequence types #615

IvanIsCoding commented May 22, 2022 •

edited

coveralls commented May 22, 2022 •

edited

jakelishman May 23, 2022 •

edited

jakelishman May 23, 2022 •

edited