[AI-1190] Implement and test augmentations #665

ChristofferEdlund · 2023-09-22T12:17:43Z

Problem

We want to introduce the possibility to use Albumentation transforms with darwin-py torch datasets.

Solution

Introducing an AlbumentationsTransform class in torch.transforms that can be be used in the following manner:

from darwin.torch.dataset import (
    ClassificationDataset,
    InstanceSegmentationDataset,
    ObjectDetectionDataset,
    SemanticSegmentationDataset,
)

from darwin.torch.transforms import AlbumentationsTransform

dataset_path = Path(r"/path/to/local/darwin/dataset")
transform = AlbumentationsTransform.from_path('/tmp/transform.json')

inst_dataset = InstanceSegmentationDataset(dataset_path=dataset_path, transform=transform)

One can initilize the AlbumentationsTransform in three ways:

AlbumentationsTransform(albumentation_transform) <- needs an albumentations transformation
AlbumentationsTransform.frompath(path) <- a path pointing to a .json or .yaml file defining the transformation
AlbumentationsTransform.fromdict(dict) <- a dictionary defining the transform

To read more about the dictionary and file formats supported, we refer to the albumentations documentation.

Further.

Instance segmentation dataset output is changed for bounding boxes to be coco format (X, Y, W, H) to be consistent with darwin-json annotations and the ObjectDetecion torch dataset. [BREAKING CHANGE]
Clamping is introduced to ObjectDetection bboxes that is outside of image for more robust data loading.

Changelog

Introduced albumentation transform support for darwin torch datasts
[BREAKING CHANGE] darwin.torch.dataset.InstanceSegmentationDataset has bbox coordinates changes from pascal_voc to coco format (X, Y, H, W)
darwin.torch.dataset.ObjectDetectionDataset clamps bbox coordinates out-of-bound.

…nstance seg to x,y,w,h format

linear · 2023-09-22T12:17:45Z

AI-1190 Implement and test augmentations

This task is about implementing and benchmarking augmentations for model training. We will first test it for object detection on HF models and benchmark against not using augmentations.

If this improve model performance, let's generate a more general solution where any integration can import and use the augmentations.

darwin/torch/transforms.py

owencjones · 2023-09-26T09:05:23Z

Updated title so that it's easier for me on deployment 😉

almazan

All changes look good to me.

Edit: noticed something in a second pass. See below.

almazan

Some comments need to be addressed before approval

darwin/torch/transforms.py

almazan · 2023-09-27T18:23:14Z

darwin/torch/transforms.py

        boxes = torch.as_tensor(boxes, dtype=torch.float32).reshape(-1, 4)
-        boxes[:, 2:] += boxes[:, :2]
        boxes[:, 0::2].clamp_(min=0, max=w)
        boxes[:, 1::2].clamp_(min=0, max=h)


This is not clamping bboxes anymore, since now indexes 2 and 3 are height and width instead of x2 and y2. Boxes now could go outside of the image.

…g test

ChristofferEdlund added 5 commits September 20, 2023 14:41

added albumentation transform, clamp on bbox in obj det and changed i…

574ed68

…nstance seg to x,y,w,h format

removed xmin < xmax check

9c0256b

now albumentations supports instance segmentation and bbox

d1b636c

updated tests

36b82a2

should work for classificaiton now as well

0a180ce

ChristofferEdlund requested a review from almazan September 22, 2023 14:40

dorfmanrobert reviewed Sep 22, 2023

View reviewed changes

darwin/torch/transforms.py Outdated Show resolved Hide resolved

ChristofferEdlund added 2 commits September 26, 2023 10:28

cleaner up

aab52e2

formatting

2ccc966

owencjones changed the title ~~Ai 1190 implement and test augmentations~~ [AI-1190] Implement and test augmentations Sep 26, 2023

better error handling when albumentations is not installed

81d48fd

almazan approved these changes Sep 27, 2023

View reviewed changes

almazan suggested changes Sep 27, 2023

View reviewed changes

darwin/torch/transforms.py Show resolved Hide resolved

almazan reviewed Sep 27, 2023

View reviewed changes

ChristofferEdlund added 3 commits September 28, 2023 11:15

fixed potential clamp issues

e5a41da

adjusted tests to reflect the xyxy to xywh changes of the instance se…

dd59fca

…g test

added the check for empty boxes

e02474f

almazan approved these changes Sep 28, 2023

View reviewed changes

ChristofferEdlund merged commit e89c93f into master Sep 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AI-1190] Implement and test augmentations #665

[AI-1190] Implement and test augmentations #665

Uh oh!

ChristofferEdlund commented Sep 22, 2023 •

edited

Loading

Uh oh!

linear bot commented Sep 22, 2023

Uh oh!

Uh oh!

owencjones commented Sep 26, 2023

Uh oh!

almazan left a comment •

edited

Loading

Uh oh!

almazan left a comment

Uh oh!

Uh oh!

almazan Sep 27, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[AI-1190] Implement and test augmentations #665

[AI-1190] Implement and test augmentations #665

Uh oh!

Conversation

ChristofferEdlund commented Sep 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Changelog

Uh oh!

linear bot commented Sep 22, 2023

Uh oh!

Uh oh!

owencjones commented Sep 26, 2023

Uh oh!

almazan left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

almazan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

almazan Sep 27, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ChristofferEdlund commented Sep 22, 2023 •

edited

Loading

almazan left a comment •

edited

Loading