Jpcbertoldo/mvtec ad loco #538

jpcbertoldo · 2022-09-02T17:05:15Z

Disclaimer

I am creating this just to keep track of the branch, please ignore the PR for the moment.

Description

Create a new dataset: MVTec LOCO Anomaly Detection.

"LOCO" stands for "LOgical COnstraints"

I based myself on anomalib/data/mvtec.py.

Fixes Add support for MVTEC LOCO AD #471

`imread_strategy`

The dataset supports an option imread_strategy which allows the user how to choose when the images are loaded:

onthefly: behaviour I found in mvtec.py, the images are loaded upon demand during the training;
preload: all the images are cached in the memory (RAM, not GPU) when the dataset is being initialized.

`anotype` and `super_anotype`

Besides providing the binary label, I also create the dataset with two other categorical values:

super_anotype: is it a logical or structural anomaly? (or a normal?)
anotype: "what is the problem with the image?", mvtec ad also has different types of anomalies for each category but this is particularly more interesting here because there are many types of logical violations possible.

I specifically included this because I am interested in evaluating separately by those types but I will later create an issue for that feature.

`mask` vs. `masks`

MVTec LOCO's logical anomalies may include several anoamlies in a single image and to properly evaluate them one needs to consider them separately so they are segmented in different mask files in the ground truth.

Since the rest of library expects a tensor mask (SINGULAR), I merge them all into a single binary maks (with loss information because they cannot be separated anymore).

In order to later peform proper evaluation there is a second tensor masks (PLURAL) which encodes each anomalous region with a different value (0 is a normal pixel, and 1, 2, ..., N are anomalous pixels).

things in `MVTecAD` but not in `MVTecLOCO`

1) `self.transform_config_val = self.transform_config_train`

        if self.transform_config_train is not None and self.transform_config_val is None:
            self.transform_config_val = self.transform_config_train

Is there a good reason for assuming this?

For me it could make sense that self.transform_config_val could have light data augmentations (say, tiny brightness changes) but that should not be repeated in the validation set.

2) `split_normal_images_in_train_set(samples, split_ratio, seed)`

MVTec LOCO already defines fixed validation sets so i did not include the option of doing it dinamically like in MVTec AD.

Checklists

Changes

Bug fix (non-breaking change which fixes an issue)
Refactor (non-breaking change which refactors the code base)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Checklist

My code follows the pre-commit style and check guidelines of this project.
I have performed a self-review of my code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
[] New and existing tests pass locally with my changes

jpcbertoldo · 2022-09-03T21:47:59Z

yet something i dont understand in anomalib.data.mvtec

# in MVTecLOCODataset.__getitem__
pre_processed = self.pre_process(image=image, mask=mask)

how this works?

does the transform re-apply the last call when mask is not None?

review-notebook-app · 2022-09-04T12:50:01Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

jpcbertoldo · 2022-09-05T11:41:46Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

@samet-akcay is this some sort of integratin in the project?

samet-akcay · 2022-09-05T13:14:15Z

Check out this pull request on
See visual diffs & provide feedback on Jupyter Notebooks.
Powered by ReviewNB

@samet-akcay is this some sort of integratin in the project?

We use ReviewNB for notebook reviews. It's not so easy to review jupyter notebooks on GitHub. ReviewNB makes it significantly easier.

…envinotoolkit#549) patchcore: Solved nans issues for large discrepancies in anomaly map

fix help description for argument task

you have to reutrn the results of load_state_dict if you wrappered the original one

…#529) Pass pre-trained from config to LightningModule

…omalib into jpcbertoldo/mvtec-ad-loco

jpcbertoldo · 2022-09-11T12:02:26Z

anomalib/data/mvtec_loco.py

+        category: str,
+        task: str = TASK_SEGMENTATION,
+        imread_strategy: str = IMREAD_STRATEGY_PRELOAD,
+        image_size: Optional[Union[int, Tuple[int, int]]] = None,


The images in this dataset are not squared.
The ratio of widh/height can end up too different than the original image when the image size is given as an int.

Maybe we should add a warning here?

jpcbertoldo · 2022-09-11T12:04:39Z

anomalib/data/mvtec_loco.py

+                "mask_paths": str(self.samples.iloc[index]["mask_paths"]),
+                # TODO CHECK IF THE DOUBLE CALL TO PREPROCESS WILL WORK WITH ALBUMENTATIONS
+                "masks": self.pre_process(image=image, mask=mask_dict["masks"])["mask"],
+                "mask": self.pre_process(image=image, mask=mask_dict["mask"])["mask"],


self.pre_process is being called for the 3rd time here, will that create any problems?

I'm thinking that maybe the random transforms will apply the same transform every two times (for the image and for the mask).

jpcbertoldo · 2022-09-11T12:27:56Z

i messed some git commands, i'm closing this one and openning another one

jpcbertoldo added 2 commits September 1, 2022 21:14

copy mvtec and add some config conts in comments

ca35031

Merge branch 'openvinotoolkit:main' into jpcbertoldo/mvtec-ad-loco

f52a448

github-actions bot added the Data label Sep 2, 2022

jpcbertoldo added 2 commits September 3, 2022 22:02

first version building the dataset

5c1b8ad

pass pre-commit hooks

2588338

jpcbertoldo added 3 commits September 3, 2022 23:49

remove todos and correct a const

4a41fc8

remove todos and correct a const

a199e54

create notebook and make small corrections

4944eaf

github-actions bot added the Notebooks label Sep 4, 2022

🐞 Fix linting issues (openvinotoolkit#535)

8e465ee

samet-akcay requested a review from djdameln September 5, 2022 11:39

bsl546 and others added 13 commits September 9, 2022 09:10

🐞 Bug Fix: Solve NaN values of anomaly scores for PatchCore model (op…

9c1c2cc

…envinotoolkit#549) patchcore: Solved nans issues for large discrepancies in anomaly map

🐞 Bug Fix: Help description for argument task (openvinotoolkit#547)

c37764c

fix help description for argument task

🐞 Bug Fix: Return results of load_state_dict func (openvinotoolkit#546)

dff86b1

you have to reutrn the results of load_state_dict if you wrappered the original one

🔨 Pass pre-trained from config to ModelLightning (openvinotoolkit…

baca449

…#529) Pass pre-trained from config to LightningModule

manage multiple masks

12206bb

copy mvtec and add some config conts in comments

434a7ad

first version building the dataset

92ad7e2

pass pre-commit hooks

7a80914

remove todos and correct a const

375ac65

remove todos and correct a const

f63d5a9

create notebook and make small corrections

fd2d917

manage multiple masks

43f154e

Merge branch 'jpcbertoldo/mvtec-ad-loco' of github.com:jpcbertoldo/an…

b7ae215

…omalib into jpcbertoldo/mvtec-ad-loco

github-actions bot added Callbacks labels Sep 11, 2022

github-actions bot added Inference Logger Tests Tools labels Sep 11, 2022

add unit tests for mvtec loco

5d9fbca

jpcbertoldo commented Sep 11, 2022

View reviewed changes

jpcbertoldo closed this Sep 11, 2022

samet-akcay mentioned this pull request Jun 7, 2023

[Task]: Implementation of newer SOTA unsupervised anomaly detection methods #1113

Closed

lemonbuilder mentioned this pull request Sep 12, 2023

[Task]: logical anomaly detection #1341

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jpcbertoldo/mvtec ad loco #538

Jpcbertoldo/mvtec ad loco #538

jpcbertoldo commented Sep 2, 2022 •

edited

Loading

jpcbertoldo commented Sep 3, 2022

review-notebook-app bot commented Sep 4, 2022

jpcbertoldo commented Sep 5, 2022

samet-akcay commented Sep 5, 2022

jpcbertoldo Sep 11, 2022

jpcbertoldo Sep 11, 2022

jpcbertoldo commented Sep 11, 2022

Jpcbertoldo/mvtec ad loco #538

Jpcbertoldo/mvtec ad loco #538

Conversation

jpcbertoldo commented Sep 2, 2022 • edited Loading

Description

imread_strategy

anotype and super_anotype

mask vs. masks

things in MVTecAD but not in MVTecLOCO

1) self.transform_config_val = self.transform_config_train

2) split_normal_images_in_train_set(samples, split_ratio, seed)

Checklists

Changes

Checklist

jpcbertoldo commented Sep 3, 2022

review-notebook-app bot commented Sep 4, 2022

jpcbertoldo commented Sep 5, 2022

samet-akcay commented Sep 5, 2022

jpcbertoldo Sep 11, 2022

Choose a reason for hiding this comment

jpcbertoldo Sep 11, 2022

Choose a reason for hiding this comment

jpcbertoldo commented Sep 11, 2022

jpcbertoldo commented Sep 2, 2022 •

edited

Loading

`imread_strategy`

`anotype` and `super_anotype`

`mask` vs. `masks`

things in `MVTecAD` but not in `MVTecLOCO`

1) `self.transform_config_val = self.transform_config_train`

2) `split_normal_images_in_train_set(samples, split_ratio, seed)`