Dev/fix latest numpy types #403

jklaise · 2021-12-06T16:24:22Z

This PR is based on the latest numpy which has type information. This flags a few issues with our types which have been resolved in this PR. It is a pre-requisite for upgrading to tensorflow 2.7.

Most of this PR is pretty straight-forward, however I've left TODO: TBD notes where I would welcome a discussion, this is mostly to do with implicit narrowing of types that either mypy can't handle (fixed easily with appropriate type: ignore[code]) or more seriously with our logic where we haven't considered that a type might not be narrowed down sufficiently (e.g. is the user list always an np.ndarray by the time it hits a function that only works on np.ndarray?).

There will be another PR enabling strict optional typing which will likely reveal a few more issues with current typing.

ascillitoe

Generally LGTM! Only bit I wonder about is whether to allow type redefinitions now?

ascillitoe · 2022-01-05T09:56:12Z

alibi_detect/ad/adversarialae.py

@@ -247,10 +249,12 @@ def score(self, X: np.ndarray, batch_size: int = int(1e10), return_predictions:
        # model predictions
        y = predict_batch(X, self.model, batch_size=batch_size)
        y_recon = predict_batch(X_recon, self.model, batch_size=batch_size)
+        y = cast(np.ndarray, y)  # help mypy out


Is there a specific reason not to use a type annotation like below here? Or is it just a convention we are going with?

y: np.ndarray = predict_batch(X, self.model, batch_size=batch_size)

It doesn't work because y is already defined and --allow-redefinitions is not enabled.

Ah that makes sense, thanks!

ascillitoe · 2022-01-05T10:15:02Z

alibi_detect/cd/pytorch/lsdd.py

@@ -83,16 +83,20 @@ def __init__(
        else:
            self.device = torch.device('cpu')

+        # TODO: TBD: the several type:ignore's below are because x_ref is typed as an np.ndarray


Do you think --allow-redefinitions would cause mypy to miss anything significant elsewhere? I wonder if it's best to activate it now as redefining variable types is pretty common for us?

Otherwise, we could go with a convention of renaming variables instead, i.e. something like

x_ref_torch = cast(torch.Tensor, torch.as_tensor(self.x_ref).to(self.device)

Although the latter seems undesirable to me...

Renaming wouldn't need the cast at all I think.

I'm a bit wary of enabling --allow-redefinitions as it's unclear as you say what we could potentially miss. Even so, it is quite limited as redefinitions would be allowed only within the same scope.

Sorry yes cast is making more sense to me now!

I'd have thought that within the same scope would mostly be fine for us? But in any case, it's not clear to me what "within the same block and nesting depth" in the mypy docs means. Also, I take your point that it's probably worth being cautious about changing this.

Well I don't know what happened but I just removed all these ignore's and locally the mypy check passes now...

Nevermind, I was being silly as had downgraded back to numpy 1.19...

Too good to be true!

ascillitoe · 2022-01-05T10:17:41Z

setup.cfg

@@ -20,6 +20,7 @@ exclude =
 [mypy]
 ignore_missing_imports = True
 strict_optional = False
+show_error_codes = True


ascillitoe · 2022-01-05T10:35:07Z

alibi_detect/od/llr.py

@@ -245,7 +245,8 @@ def logp(self, dist, X: np.ndarray, return_per_feature: bool = False, batch_size
        Log probabilities.
        """
        logp_fn = partial(dist.log_prob, return_per_feature=return_per_feature)
-        return predict_batch(X, logp_fn, batch_size=batch_size)
+        # TODO: TBD: can this be any of the other types from predict_batch? i.e. tf.Tensor or tuple


I'm not sure about this one. I'd have guessed that predict_batch won't return tf.Tensor here since we don't explicitly set dtype (so return_np=True), but it looks like tuple might be possible? Hopefully @arnaudvl has more input!

jklaise · 2022-01-05T15:47:38Z

There are a few more issues with the recently dropped numpy 1.22 for which I will push another commit shortly.

… cd.base_online

jklaise added 24 commits December 2, 2021 11:03

Cast to numpy types in ad subpackage

8481b77

Fix numpy type in datasets.py

782a827

Fix numpy type in utils.pytorch.prediction.py

22f491a

Fix numpy type in utils.perturbation.py

cdb16db

Fix numpy type in cd.tensorflow.preprocess.py

83c62a7

Fix tuple type in utils.tensorflow.data

dd7c493

Fix types in cd.utils

315cd1c

Fix types in cd.pytorch.preprocess

237d46e

Fix types in cd.base_online

f9f1785

Fix types in cd.base

c9f7e14

Fix types in od.llr

f028357

Ignore Liskov substituion principle violations

26623bb

Fix types in cd.tensorflow.classifier

88ecffb

Fix types in cd.tensorflow.lsdd

d7637c2

Fix types in cd.pytorch.mmd

0ce7010

Fix types in cd.pytorch.lsdd_online

0e7d666

Fix types in cd.pytorch.lsdd

46aec52

Fix types in cd.model_uncertainty

41bebdd

Fix types from np.array to np.ndarray

ce79ccd

Fix types in test_model_uncertainty

ed93bf8

Fix types in od.ae and od.vae

88c8d25

Fix types in utils.tensorflow.prediction

5001295

Return mypy error codes

bed775e

Run linter

f09efed

jklaise requested review from arnaudvl, ascillitoe and ojcobb December 6, 2021 16:24

ascillitoe approved these changes Jan 5, 2022

View reviewed changes

More precise numpy typing in utils.perturbation

373c10a

jklaise added 5 commits January 5, 2022 16:29

More precise numpy typing in utils.pytorch.prediction

cea5907

Add numpy type explicitly in od.sr

1c89af0

Ignore call override in cd.base

2c92af3

Add ignores to numpy types when initializing from empty containers in…

063c859

… cd.base_online

Run linter

b0f4eda

jklaise merged commit d2d413c into SeldonIO:master Jan 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dev/fix latest numpy types #403

Dev/fix latest numpy types #403

jklaise commented Dec 6, 2021

ascillitoe left a comment

ascillitoe Jan 5, 2022

jklaise Jan 5, 2022

ascillitoe Jan 5, 2022

ascillitoe Jan 5, 2022

jklaise Jan 5, 2022

ascillitoe Jan 5, 2022

jklaise Jan 5, 2022

jklaise Jan 5, 2022

ascillitoe Jan 5, 2022

ascillitoe Jan 5, 2022

ascillitoe Jan 5, 2022

jklaise commented Jan 5, 2022

Dev/fix latest numpy types #403

Dev/fix latest numpy types #403

Conversation

jklaise commented Dec 6, 2021

ascillitoe left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jklaise commented Jan 5, 2022