Duckarray tests for constructors and properties #6903

TomNicholas · 2022-08-09T18:36:56Z

Builds on top of #4972 to add tests for Variable/DataArray/Dataset constructors and properties when wrapping duck arrays.

Adds a file xarray/tests/duckarrays/base/constructors.py which contains new test base classes.

Also uses those new base classes to test Sparse array integration (not yet tried for pint integration).

Closes part of Public testing framework for duck array integration #6894
Tests added (tests for tests?? Maybe...)
User visible changes (including notable bug fixes) are documented in whats-new.rst
New functions/methods are listed in api.rst

for more information, see https://pre-commit.ci

…Nicholas/xarray into duckarray-tests-constructors

TomNicholas · 2023-12-14T04:42:10Z

@keewis I've come back to this PR now that #8404 is merged. I have:

Rewritten the reduce tests to use the new variables strategy
Used the array_strategy_fn pattern everywhere
Added a new file for testing xarray compatibility with the minimal implementation of the array API standard numpy.array_api
Rewritten the sparse tests to match
Removed the pint tests / DataArray tests for now
Reorganised the files to put all the "base" test classes in xarray/testing, and all the actual concrete tests that get run in xarray/tests. This follows the pattern I used in Hypothesis strategy for generating Variable objects #8404.

I just wanted to show you to see what you thought of this approach now.

TomNicholas · 2023-12-14T04:43:37Z

xarray/testing/duckarrays.py

+    @staticmethod
+    @abstractmethod
+    def array_strategy_fn(
+        *, shape: "_ShapeLike", dtype: "_DTypeLikeNested"
+    ) -> st.SearchStrategy[T_DuckArray]:
+        # TODO can we just make this an attribute?
+        ...
+


I would like to just do this in the subclassed tests:

class VariableConstructorTests array_strategy_fn = ...

but then would I need to make array_strategy_fn an abstract property?

TomNicholas · 2023-12-14T04:44:15Z

xarray/testing/duckarrays.py

+    @pytest.mark.parametrize(
+        "method",
+        (
+            "all",
+            "any",
+            # "cumprod",  # not in array API
+            # "cumsum",  # not in array API
+            # "max",  # only in array API for real numeric dtypes
+            # "max",  # only in array API for real floating point dtypes
+            # "median",  # not in array API
+            # "min",  # only in array API for real numeric dtypes
+            # "prod",  # only in array API for numeric dtypes
+            # "std",  # TypeError: std() got an unexpected keyword argument 'ddof'
+            # "sum",  # only in array API for numeric dtypes
+            # "var",  # TypeError: std() got an unexpected keyword argument 'ddof'


Clearly we need to be able to test certain reductions with only certain dtypes

TomNicholas · 2023-12-14T04:47:44Z

xarray/testing/duckarrays.py

+    def test_construct(self, data) -> None:
+        shape = data.draw(self.shapes)
+        dtype = data.draw(self.dtypes)
+        arr = data.draw(self.array_strategy_fn(shape=shape, dtype=dtype))


You can't use strategies.variables to test the constructor because strategies.variables has a call to the constructor inside it, so you end up not being able to access the original raw array that you want to use as the "expected" value.

for more information, see https://pre-commit.ci

TomNicholas · 2023-12-31T19:01:16Z

Okay I'm a bit stumped by how to do this. I want some test setup where:

We define sets of tests that a downstream library can import and run using just a couple of lines of code,
The downstream library can override the generated array type and the exact way equality is checked (e.g. to test sparse type arrays),
Certain tests are parametrized over many cases, which we ideally set upstream (e.g. over sum, mean, std, etc.),
But the downstream library can choose to xfail certain cases (e.g. because they haven't implemented std yet),
Different cases are tested for different dtypes automatically (e.g. because the array API standard supports string dtypes in any but not in max),
But the downstream library can also override those dtypes too (e.g. because they support more/less than what's currently in the array API).

I was hoping to do this using some combination of upstream test classes, pytest.parametrize on the test method, and xfail, but I can't really see how to get (4) and (6) to work with that approach. I might have to resort to using globals or something...

keewis · 2023-12-31T19:08:33Z

If you look at #4972, there's a new mark that applies other marks, like skip. The API is a bit awkward, and the implementation might be out of date, but it demonstrates that this is possible. Similarly, if you pass the dtypes as a fixture you should be able to override it on a per-module level (pytest has a whole bunch of docs on overriding fixtures).

Edit: although fixtures are not that different from glorified globals

TomNicholas · 2023-12-31T19:13:47Z

If you look at #4972, there's a new mark that applies other marks, like skip.

Interesting, thank you!

Similarly, if you pass the dtypes as a fixture you should be able to override it on a per-module level (pytest has a whole bunch of docs on overriding fixtures).

Also a good idea, but I think I need to override the dtype fixtures on a per-parametrization level...

keewis · 2023-12-31T19:40:38Z

Depends on what you want to parametrize, I guess. In general, though, I'm starting to lean towards not parametrizing the functions to test: pytest.mark.parametrize creates "variants" of the same test (i.e. it should test the same thing with different inputs), where having one function per "test" makes the summaries a bit more useful. Plus, running the tests of a single function are much easier that way compared to if they were variants. The downside is code duplication, but I don't think that's too much of an issue, especially for tests.

keewis added 30 commits February 28, 2021 00:51

add a initial, tiny draft of the automatic duckarray test machinery

21696bf

add missing comma

f14ba29

fix the global marks

90f9c41

don't try to apply marks if marks is None

aa4a457

only set pytestmark if the value is not None

9fa2eca

skip the module if pint is not installed

7994bad

filter UnitStrippedWarnings

c4a35f0

also test sparse

0efbbbb

add a test for the test extractor

73499b5

move the selector parsing code to a new function

532f213

also skip the sparse tests

f44aafa

move the utils tests into a different file

d651438

don't keep the utils tests in a test group

f84894a

split apply_marks into two separate functions

0090db5

add a mark which attaches marks to test variants

ef05c7d

move the duckarray testing module to tests

20334d9

move the utils to a separate module

f7acc0f

fix the existing tests

e41a15b

completely isolate the apply_marks tests

1f095a1

add a test for applying marks to test variants

2503af7

skip failing test variants

b229645

fix the import path

0723418

rename the duckarray testing module

6c4ccb0

use Variable as example

c4aa05a

fix the skips

fc97e90

only use dimensionless for cumprod

31e577a

also test dask wrapped by pint

8d80212

add a function to concatenate mappings

7c43e91

add tests for preprocess_marks

b6a90df

fix the tests

a95b5c4

TomNicholas and others added 17 commits December 13, 2023 17:43

Merge branch 'main' into duckarray-tests-constructors

b8af1e6

move testing framework to testing module

2aba7bc

absolute imports

9c38519

test_units -> test_pint

8b89911

first variable constructor tests pass for numpy.array_api

ab64e5e

constructor tests now don't use xarray strategies

f4dd250

constructor tests for sparse

626efdf

use new strategies in reduce test and remove old code

ff08473

reinstate strategies module which I accidentally git rmed

5ab5a74

reduce tests for numpy array api now pass

ec7f726

use suppress_warning utility

843217e

test sparse reductions

9d585ce

remove old utilities

7dc832d

[pre-commit.ci] auto fixes from pre-commit.com hooks

32cbdc2

for more information, see https://pre-commit.ci

remove pint tests for now

a002d0b

Merge branch 'duckarray-tests-constructors' of https://github.com/Tom…

52682bd

…Nicholas/xarray into duckarray-tests-constructors

remove conftest stuff

bfc3fe7

TomNicholas commented Dec 14, 2023

View reviewed changes

TomNicholas and others added 3 commits December 16, 2023 23:23

single class can test variable/dataarray/dataset

d23eaec

test numpy outside of array API

8e3c655

[pre-commit.ci] auto fixes from pre-commit.com hooks

c07c690

for more information, see https://pre-commit.ci

TomNicholas mentioned this pull request Dec 18, 2023

Support non-str Hashables in DataArray #8559

Merged

3 tasks

narrow dtypes

d2b35c5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Duckarray tests for constructors and properties #6903

Duckarray tests for constructors and properties #6903

TomNicholas commented Aug 9, 2022 •

edited

Loading

TomNicholas commented Dec 14, 2023 •

edited

Loading

TomNicholas Dec 14, 2023

TomNicholas Dec 14, 2023

TomNicholas Dec 14, 2023

TomNicholas commented Dec 31, 2023

keewis commented Dec 31, 2023 •

edited

Loading

TomNicholas commented Dec 31, 2023

keewis commented Dec 31, 2023

Duckarray tests for constructors and properties #6903

Are you sure you want to change the base?

Duckarray tests for constructors and properties #6903

Conversation

TomNicholas commented Aug 9, 2022 • edited Loading

TomNicholas commented Dec 14, 2023 • edited Loading

TomNicholas Dec 14, 2023

Choose a reason for hiding this comment

TomNicholas Dec 14, 2023

Choose a reason for hiding this comment

TomNicholas Dec 14, 2023

Choose a reason for hiding this comment

TomNicholas commented Dec 31, 2023

keewis commented Dec 31, 2023 • edited Loading

TomNicholas commented Dec 31, 2023

keewis commented Dec 31, 2023

TomNicholas commented Aug 9, 2022 •

edited

Loading

TomNicholas commented Dec 14, 2023 •

edited

Loading

keewis commented Dec 31, 2023 •

edited

Loading