[issue-799] coco split into train/test; updated coco dataset module api #805

yromanyshyn · 2022-02-02T11:48:57Z

resolves #799

review-notebook-app · 2022-02-02T11:49:02Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

deepchecks/vision/datasets/detection/coco.py

yromanyshyn · 2022-02-02T15:22:31Z

update:

@ItayGabbay @nirhutnik
please take a look at the deepchecks/vision/metrics_utils/detection_precision_recall.py
I had to make a small fix to it, not sure if it is correct
The reason why I added it is because it was failing at that point

In general _compute_ap_recall was returning a dict with all values set to None, but
tensor cannot be created with none values, you will get an error if you try

RuntimeError: Could not infer dtype of NoneType

nirhutnik · 2022-02-02T15:34:38Z

Seems good to me, but would prefer Gabbay take a look as its his code. If I understand correctly, this means that sometimes we get average precision = None (why?), and in those cases you want it to count as 0. Not sure what is done later with that information - if its summed up, then ok, but I guess it is averaged again and in that case maybe should be just ignored and not counted in the denominator as well.

But didn't dive into the code yet.

yromanyshyn · 2022-02-02T16:27:47Z

also, are those two lines within VisionContext correct?
(that is a reason why notebook CI/CD check fails)

80: if train and test:
81:      train.validate_shared_label(test)

if I got it right, in object-detection task label is a 2d array each row of which
represent a separate object (bbox), and each image(label) could contain different
number of objects

we probably should only verify that number of columns is the same?

nirhutnik · 2022-02-02T16:50:02Z

@yromanyshyn I think that Gabbay's PR solves that anyway as he transfers label validation to the LabelEncoder classes

ItayGabbay · 2022-02-03T07:27:28Z

deepchecks/vision/datasets/detection/coco.py

+    pin_memory: bool = True,
+    object_type: Literal['Dataset', 'DataLoader'] = 'DataLoader'
+) -> t.Union[DataLoader, vision.VisionDataset]:
+    """Get the COCO dataset and return a dataloader.


I would call it the COCO 128 dataset, as the COCO dataset is much larger than that.

Also, did we validate the license of this dataset? and model?

changed,
it is GNU Version 3

ItayGabbay · 2022-02-03T07:33:37Z

makefile

-	 xargs -P4 -I'{}' $(JUPYTER) nbconvert --execute '{}' \
-	  --to notebook --stdout > /dev/null
+
+	$(JUPYTER) nbconvert --execute $$(find ./docs/source/examples -name "*.ipynb") --to notebook --stdout > /dev/null


weird but I initially thought that this was causing 'notebook check' failure. I have undone it

Maybe you are correct. I'm now checking this

ItayGabbay · 2022-02-03T07:36:21Z

deepchecks/vision/metrics_utils/detection_precision_recall.py

@@ -79,7 +79,10 @@ def compute(self):
                **self._compute_ap_recall(ev["scores"], ev["matched"], ev["NP"])
            }
        if self.return_ap_only:
-            res = torch.tensor([res[k]["AP"] for k in sorted(res.keys())])
+            res = torch.tensor([


I think that's OK. This may happen in the case the model detected a class that doesn't exist in the test set.
Anyway, we are planning to replace this module in the near future.

…into issue-799

update

a89c754

yromanyshyn added the refactoring Making significant changes to structure of code label Feb 2, 2022

yromanyshyn requested review from ItayGabbay and nirhutnik February 2, 2022 11:48

yromanyshyn self-assigned this Feb 2, 2022

nirhutnik reviewed Feb 2, 2022

View reviewed changes

deepchecks/vision/datasets/detection/coco.py Outdated Show resolved Hide resolved

yromanyshyn added 3 commits February 2, 2022 16:12

update

dc4e338

update

aba316c

update

6b4d640

makefile update

e161615

ItayGabbay requested changes Feb 3, 2022

View reviewed changes

yromanyshyn added 3 commits February 3, 2022 10:00

update

f1881e4

comments update

fac1d96

Merge branch 'main' into issue-799

24886f2

ItayGabbay approved these changes Feb 3, 2022

View reviewed changes

yromanyshyn and others added 4 commits February 4, 2022 01:00

Merge branch 'main' into issue-799

2bd09b9

Merge branch 'main' into issue-799

7b63492

Merge branch 'issue-799' of https://github.com/deepchecks/deepchecks …

e4032a9

…into issue-799

label validation fix

d0c8612

yromanyshyn merged commit d546f38 into main Feb 8, 2022

delete-merged-branch bot deleted the issue-799 branch February 8, 2022 10:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[issue-799] coco split into train/test; updated coco dataset module api #805

[issue-799] coco split into train/test; updated coco dataset module api #805

yromanyshyn commented Feb 2, 2022

review-notebook-app bot commented Feb 2, 2022

yromanyshyn commented Feb 2, 2022 •

edited

nirhutnik commented Feb 2, 2022

yromanyshyn commented Feb 2, 2022

nirhutnik commented Feb 2, 2022

ItayGabbay Feb 3, 2022

ItayGabbay Feb 3, 2022

yromanyshyn Feb 3, 2022

ItayGabbay Feb 3, 2022

yromanyshyn Feb 3, 2022

ItayGabbay Feb 3, 2022

ItayGabbay Feb 3, 2022

yromanyshyn Feb 3, 2022

[issue-799] coco split into train/test; updated coco dataset module api #805

[issue-799] coco split into train/test; updated coco dataset module api #805

Conversation

yromanyshyn commented Feb 2, 2022

review-notebook-app bot commented Feb 2, 2022

yromanyshyn commented Feb 2, 2022 • edited

nirhutnik commented Feb 2, 2022

yromanyshyn commented Feb 2, 2022

nirhutnik commented Feb 2, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yromanyshyn commented Feb 2, 2022 •

edited