Semi-SL Semantic Segmentation. Prototype View. #2156

kprokofi · 2023-05-15T21:52:03Z

Summary

How to test

Checklist

I have added unit tests to cover my changes.
I have added integration tests to cover my changes.
I have added e2e tests for validation.
I have added the description of my changes into CHANGELOG in my target branch (e.g., CHANGELOG in develop).
I have updated the documentation in my target branch accordingly (e.g., documentation in develop).
I have linked related issues.

License

I submit my code changes under the same Apache License that covers the project.
Feel free to contact the maintainers if that's a concern.
I have updated the license header for each file (see an example below).

# Copyright (C) 2023 Intel Corporation
# SPDX-License-Identifier: Apache-2.0

kprokofi · 2023-05-16T21:45:24Z

This PR includes new solution for Semi-SL approach. It is implemented for new models, SegNext. In the next PRs documentation will be updated with validation metrics on some public datasets used for validation. Also, experiments for tuning some hyperparameters are in progress in the background mode.
For now, these changes (Prototype based approach) can achieve the following result:

model	cityscapes 1/16	kitty_54	VOC 1/16	DISK 1/4	city4 1/16	voc_12	Mean Dice gain
ham_segnext_t: SUP	55.93	62.35	73.82	86.87	68.3	68	0
ham_segnext_t: MT	59.2	66.68	76.14	87.4	68.9	69.78	+ 2.14
ham_segnext_t: Proto 0.1	60.2	67.1	77.2	87.6	69.21	70.4	+ 2.74

Experiments with bigger models in progress.
Some results:
SegNext-s:

model	cityscapes 1/16	kitty_54	VOC 1/16	voc_12	Mean Dice gain
ham_segnext_s: MT	67.02	68.11	79.5	75.11	0
ham_segnext_s: Proto 0.1	69.71	68.54	80.51	75	+ 1%

otx/algorithms/segmentation/adapters/mmseg/models/segmentors/mean_teacher_segmentor.py

otx/algorithms/segmentation/adapters/mmseg/task.py

JihwanEom · 2023-05-17T01:25:17Z

Could you please clarify which paper you've implemented, and provide a link to it? Am I correct in understanding that your implementation is based on "Semi-supervised Semantic Segmentation with Prototype-based Consistency Regularization" (https://arxiv.org/pdf/2210.04388.pdf)?

If this is the case, would you consider adopting a step-by-step approach before fully integrating it?

As you might be aware, the Mean Teacher model and CutMix-seg are commonly used baselines in academia. It might be beneficial to first explore the performance and training time trade-off for CutMix-seg and then incrementally add Prototype view functionalities.

I would also suggest trying to keep changes to the MeanTeacherSegmentor to a minimum by defining other class for Prototype view, as it could serve as a standard baseline architecture for semi-supervised semantic segmentation.

kprokofi · 2023-05-17T07:11:07Z

@JihwanEom
I use this paper as base idea and this one: https://arxiv.org/abs/2203.15102 for implementation reference

Unfortunately, Cutmix-seg performs poorly in OTX, I conducted experiments with that including different probability as well as Soobee did and saw accuracy degradation. But, CutOut helped and in final solution I changed CutMix to CutOut.

What do you mean step by step? I integrated Prototype network and enhanced a bit MT with CutOut and Filter pixels with high entropy. You can use standard MeanTeacher as always, just remove protohead from config. Also, old models (HrNets) use MT, I didn't change their configs.
It is impossible to use different Class, because the base method it is Mean Teacher.

Many changes in that PR is just "black" some files

JihwanEom · 2023-05-17T07:45:00Z

Thank you for the detailed explanation. I agree that CutOut can bring the promising performance improvements, but I believe that CutMix may work in our situation. Have you investigated the performance tendencies for both the lite-hrnet and SegNext templates? (or include ResNets?)

According to the CutMix-seg paper (https://arxiv.org/pdf/1906.01916.pdf), CutMix brings significant performance gains compared to CutOut even with only 100 labeled images. I don't think there is a significant difference in the environment between our situation and the one described in the paper.

Even if CutMix does not work due to unknown issues and we use CutOut instead, we need to assess the net benefit of using CutOut without combining it with Prototype view. That was my intention for taking a step-by-step approach. Could you share the experiment results if you have already?

You can continue using the standard MeanTeacher as usual, just remove protohead from the configuration. Also, the old models (HrNets) use MeanTeacher, and I haven't changed their configurations.
=> My suggestion was to consider inheriting MeanTeacherSegmentor in the new architecture of PrototypeViewSegmentor. This would ensure maintainability and reproducibility for not only MT and new models. I recommend creating a new file that defines PrototypeViewSegmentor and inheriting from MeanTeacherSegmentor, because it's a base method for many various algorithms for semi-sl on semantic segmentation as you said. But it's also my individual opinion as OTX developer, please share us if other one have good design ideas.

kprokofi · 2023-05-17T15:00:15Z

My suggestion was to consider inheriting MeanTeacherSegmentor in the new architecture of PrototypeViewSegmentor. This would ensure maintainability and reproducibility for not only MT and new models. I recommend creating a new file that defines PrototypeViewSegmentor and inheriting from MeanTeacherSegmentor, because it's a base method for many various algorithms for semi-sl on semantic segmentation as you said.

I consider your proposal and even tried to do that, but I faced a problem with that. If I do that -> I will copy all the code from forward_train method. Why should we do that?
I moved everything related to Prototypes to decode_proto_network. And there are new lines of code in main forward method:

I think it is ambigious to create different Segmentor and copy almost all code there.
We also have second option, not copy, but use parent forward and then compute proto based forward, but in that case I should call self.model_s.extract_feat twice. It is double inference

I would consider leave it in the main MeanTeacher framework as ProtoNetwork is additional method rather than main one.

kprokofi · 2023-05-17T18:25:34Z

I have some experiments comparing CutMix, CutOut and base algo

But I would like to conduct it one more time with different implementation of CutMix, SegNext-s and without early stopping

otx/algorithms/segmentation/configs/configuration.yaml

otx/algorithms/segmentation/adapters/mmseg/models/segmentors/mean_teacher_segmentor.py

otx/algorithms/segmentation/adapters/mmseg/models/utils/proto_utils.py

supersoob

I think it's okay to keep use_prototype_head in original MeanTeacherSegmentor and enable it by parameter because it were eventually based on mean teacher as current @kprokofi's implementation. About Cutmix, even though I checked the transformed image is alright, I couldn't see the gain of it (but only 1-2% drop). In my investigation, at that time there were some doubtable points. First, it was needed to fix any confidence threshold for pseudo label. Second, inference needed to be done in teacher model which might more generalized than student. And I doubted the cross entropy loss using as consistency loss is affecting it, which it uses prediction from the model trained with few labeled model that might not well-generalized as gt. I don't exactly know what caused cutmix make worse but I hope these helped some for you to refer.

otx/algorithms/segmentation/adapters/mmseg/models/heads/proto_head.py

supersoob · 2023-05-18T07:03:57Z

Could you add unit tests for prototype head and for changed mean teacher?

requirements/segmentation.txt

eunwoosh · 2023-05-22T01:42:20Z

LGTM, but as Soobee said, could you add unit test?

kprokofi · 2023-05-22T16:57:08Z

@JihwanEom
Please, find below experiments with augmentations on bigger model. Unfortunately, cutmix always performs worse in our OTX. I tried 2 different implementations, but result is the same - CutOut looks better

Model	Cityscapes	VOC	Kitty_57
SegNext-b: MT	70.88	82.05	70.45
SegNext-b: MT + pixel_filter + cutout	71.87	82.35	73.67
SegNext-b: MT + pixel_filter + cutmix	70.43	81.80	69.44
SegNext-b: MT + pixel_filter + cutout + ProtoNet	72.10	82.74	74.31

kprokofi · 2023-05-23T09:56:59Z

I added unit tests + integration tests for Semi-SL and e2e for new model (we need to start validate it at least one template)
Could you take a look and merge if it looks good to you?

JihwanEom · 2023-05-24T00:43:31Z

@JihwanEom Please, find below experiments with augmentations on bigger model. Unfortunately, cutmix always performs worse in our OTX. I tried 2 different implementations, but result is the same - CutOut looks better

Model Cityscapes VOC Kitty_57
SegNext-b: MT 70.88 82.05 70.45
SegNext-b: MT + pixel_filter + cutout 71.87 82.35 73.67
SegNext-b: MT + pixel_filter + cutmix 70.43 81.80 69.44
SegNext-b: MT + pixel_filter + cutout + ProtoNet 72.10 82.74 74.31

Okay, thank you so much for experiment results and kind explanation. I understood.

jaegukhyun · 2023-05-24T05:14:20Z

Could you set milestone for this PR?

kprokofi · 2023-05-24T07:06:51Z

Could you set milestone for this PR?

Done. Could you approve if it is OK? Let's finally merge this.
In the following PR I will update documentation accordingly

github-actions bot added the ALGO Any changes in OTX Algo Tasks implementation label May 15, 2023

kprokofi added the ENHANCE Enhancement of existing features label May 15, 2023

kprokofi force-pushed the kp/semisl_proto branch from 40d58a0 to 5503ac2 Compare May 16, 2023 21:05

kprokofi marked this pull request as ready for review May 16, 2023 21:24

kprokofi requested a review from a team as a code owner May 16, 2023 21:24

github-actions bot added the TEST Any changes in tests label May 16, 2023

eunwoosh reviewed May 17, 2023

View reviewed changes

github-actions bot added BUILD DEPENDENCY Any changes in any dependencies (new dep or its version) should be produced via Change Request on PM and removed BUILD labels May 17, 2023

supersoob reviewed May 18, 2023

View reviewed changes

otx/algorithms/segmentation/adapters/mmseg/models/heads/proto_head.py Show resolved Hide resolved

supersoob reviewed May 18, 2023

View reviewed changes

requirements/segmentation.txt Show resolved Hide resolved

kprokofi force-pushed the kp/semisl_proto branch from 7aff014 to ee9d337 Compare May 20, 2023 10:49

kprokofi added 7 commits May 23, 2023 16:09

added cutmix, filter loss

3a3472b

added semisl to segnext, added ohem loss

be2ceea

added ProtoNet

1194b09

changed loss handling

ed17c9e

proto_head_debugged

67f508f

changes for experiments

92a48de

added ham-seg based head for proto

48746a7

kprokofi added 14 commits May 23, 2023 16:09

black files back

efd4b44

revert configuration back

cfe9dae

minor EMA change

a99c2ca

revert semisl recipie back

77fab4c

fix configure test

8d5e3a3

fix dual model ema test

3b5141a

fix tests

90bde72

minor rever back

b122ba2

fix classes in aux head

daeb13b

reply to comments

b88e149

added docstrings

e853672

added articles to description

0880a6a

added unit tests, integration and e2e

4cc348b

fix convert via ignore

291f7f3

kprokofi force-pushed the kp/semisl_proto branch from 7baae35 to 291f7f3 Compare May 23, 2023 07:57

github-actions bot added the API Any changes in OTX API label May 23, 2023

kprokofi requested review from supersoob, eunwoosh and JihwanEom May 23, 2023 09:55

eunwoosh approved these changes May 24, 2023

View reviewed changes

kprokofi added this to the 1.4.0 milestone May 24, 2023

supersoob approved these changes May 24, 2023

View reviewed changes

kprokofi merged commit e1004eb into openvinotoolkit:develop May 24, 2023
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Semi-SL Semantic Segmentation. Prototype View. #2156

Semi-SL Semantic Segmentation. Prototype View. #2156

kprokofi commented May 15, 2023

kprokofi commented May 16, 2023 •

edited

JihwanEom commented May 17, 2023

kprokofi commented May 17, 2023 •

edited

JihwanEom commented May 17, 2023 •

edited

kprokofi commented May 17, 2023 •

edited

kprokofi commented May 17, 2023

supersoob left a comment

supersoob commented May 18, 2023

eunwoosh commented May 22, 2023

kprokofi commented May 22, 2023

kprokofi commented May 23, 2023

JihwanEom commented May 24, 2023

jaegukhyun commented May 24, 2023

kprokofi commented May 24, 2023

Semi-SL Semantic Segmentation. Prototype View. #2156

Semi-SL Semantic Segmentation. Prototype View. #2156

Conversation

kprokofi commented May 15, 2023

Summary

How to test

Checklist

License

kprokofi commented May 16, 2023 • edited

JihwanEom commented May 17, 2023

kprokofi commented May 17, 2023 • edited

JihwanEom commented May 17, 2023 • edited

kprokofi commented May 17, 2023 • edited

kprokofi commented May 17, 2023

supersoob left a comment

Choose a reason for hiding this comment

supersoob commented May 18, 2023

eunwoosh commented May 22, 2023

kprokofi commented May 22, 2023

kprokofi commented May 23, 2023

JihwanEom commented May 24, 2023

jaegukhyun commented May 24, 2023

kprokofi commented May 24, 2023

kprokofi commented May 16, 2023 •

edited

kprokofi commented May 17, 2023 •

edited

JihwanEom commented May 17, 2023 •

edited

kprokofi commented May 17, 2023 •

edited