SOC problem #6

DavidKong96 · 2020-12-08T02:24:10Z

Thanks for your sharing.Nice work~
here is a question about the SOC in your paper.
The self-supervied stage is used in the new domain datasets, so the new or the target datasets are which we will test later?

And another question is when i try to train MODNet, the prediction of dp if just boundary which is not same as your paper .

ZHKKKe · 2020-12-09T05:13:07Z

Hi, thanks for your attention!

For your questions:
Q1: so the new or the target datasets are which we will test later?
The new domain dataset should be split into a training subset S_t and a validation subset S_v. SOC self-supervised strategy finetunes the model in S_t and tests it in S_v.
For example, if a model is finetuned by the the data from our WebCam (S_t), users can test this model with the data captured by their own WebCam (S_v).

Q2: the prediction of dp if just boundary which is not same as your paper
It is difficult for me to point out the problem only through these two images. Can you share one of your training samples (including I, alpha_g, and m_d in the paper Fig.2) with me? I can try to help you find the problem. Thanks.

DavidKong96 · 2020-12-09T05:32:37Z

Hi, thanks for your attention!

For your questions:
Q1: so the new or the target datasets are which we will test later?
The new domain dataset should be split into a training subset S_t and a validation subset S_v. SOC self-supervised strategy finetunes the model in S_t and tests it in S_v.
For example, if a model is finetuned by the the data from our WebCam (S_t), users can test this model with the data captured by their own WebCam (S_v).

Q2: the prediction of dp if just boundary which is not same as your paper
It is difficult for me to point out the problem only through these two images. Can you share one of your training samples (including I, alpha_g, and m_d in the paper Fig.2) with me? I can try to help you find the problem. Thanks.

thanks a lot for your reply. i have found the reson of Q2. maybe my kernel size of dilate/erode is too small , make the wrong result.could you please tell me the size of your dilate/erode kernel?
and another question is when we try self-supervised stage on video, it's still based on single image? is there any relation about time imformation?
thank your very much!

ZHKKKe · 2020-12-09T06:01:22Z

Q1: could you please tell me the size of your dilate/erode kernel?
I use scipy.ndimage.grey_dilation and scipy.ndimage.grey_erosion for dilation and erosion.
For an image with short side of 512:

In supervised training stage, m_d is generated by random kernel size in (5, 10).
In SOC self-supervised stage, m_d is generated by kernel size of 30.

In fact, m_d in paper Figure 2 is a good example, please set your parameters according to it.

Q2: it's still based on single image? is there any relation about time imformation?
It is still based on single image. In this work, we did not consider temporal information.

DavidKong96 · 2020-12-09T06:04:51Z

thank you very much. it helps me a lot.

TsykunovDmitriy · 2020-12-23T20:35:00Z

Hello! Thanks for your interesting work.
Trying to reproduce the SOC training. Predicts start converging to pred_detail pretty quickly. This behavior seems to be quite logical. Have you met with this? If so, how did you deal with it?

ZHKKKe · 2020-12-24T05:00:21Z

@TsykunovDmitriy
Hi, thanks for your attention!
Based on your results, I think you forgot to use \tilde{m}_d (Eq. 7,8 in our paper) to apply the detail consistency losses (Eq.8 and the second term in Eq.7) only on the boundary regions. Please check it.

TsykunovDmitriy · 2020-12-24T07:51:15Z

Thanks for the answer. I wrote below the pseudocode for the implementation of equations 7, 8 which I use in my training pipeline. Tell me where I'm wrong.

pred_semantic, pred_detail, pred_matte = model(image)
pred_semantic_fz, pred_detail_fz, pred_matte_fz = model_freeze(image)

de_mask = get_dilate_erode_mask(pred_matte.numpy())
seg_mask = get_segmentation_mask(pred_matte.numpy())

# equation 7
Ls = 0.5*( sqrt([pred_semantic - seg_mask]**2) ).mean()
Ld = (abs(pred_detail - pred_matte)* de_mask).sum() / de_mask.sum() # the same in training
Lcons = Ls + Ld

# equation 8
Ldd = (abs(pred_detail - pred_detail_fz)* de_mask).sum() / de_mask.sum()

loss = Lcons + Ldd
mode.update_weights(loss)

If possible, then answer a few more questions. How much data did you use for fine-tuning? How many epochs? What does it mean "simultaneously" in paper?

ZHKKKe · 2020-12-24T09:40:04Z

Q1: How much data did you use for fine-tuning?
We use 400 video clips consists of 50k frames.

Q2: How many epochs?
About 10 epochs.

Q3: What does it mean "simultaneously" in paper?
"simultaneously" means loss = Lcons + Ldd. You are correct.

However, you sould split Ls into two terms as:

Ls = 0.5*( sqrt([pred_semantic - seg_mask.detach()]**2) ).mean() + 0.5*( sqrt([pred_semantic.detach() - seg_mask]**2) ).mean()

The gradients should go back from both branches at the same time (You should make sure the gradient can be back propagated through seg_mask).
Of course, you also need to do the same thing for Ld.

Besides, could you visualize a sample with de_mask and seg_mask? They are important for finding the problem.

TsykunovDmitriy · 2020-12-24T10:23:28Z

Thanks a lot for the answers.
I thought G(*) was a non-differentiable function. I will conduct a few experiments and if the problem is not solved, I will share additional visualizations.

TsykunovDmitriy · 2020-12-25T12:39:26Z

I did some experiments. Unfortunately, the result has not improved. pred_matte predictions still converge pretty quickly to pred_detail. I have visualize some examples of de_mask and seg_mask.

I have a guess that I have incorrectly formulated equation 5 from the paper. Could you please comment Ld from here?

ZHKKKe · 2020-12-25T13:48:56Z

@TsykunovDmitriy
I think your de_mask and seg_mask are correct. I cannot find the problem from your code snippet.

Maybe you can try the following code for calculating Ld:

Ld = ( abs(pred_detail - pred_detail_fz.detach()) * de_mask + abs(pred_matte - pred_matte_fz.detach()) * de_mask  ).sum() / de_mask.sum()

In this way, you do not need the Ldd any more, i.e., loss = Lcons.

This is our old implementation. It can also work well in our case.

TsykunovDmitriy · 2021-01-13T09:45:46Z

Thanks for your advice! Unfortunately, I did not receive satisfactory results. I think this is due to the small amount of data at the stage of training the model. Perhaps my model is not generalized enough. I think you can close the discussion.

ZHKKKe · 2021-01-13T11:02:47Z

@TsykunovDmitriy
Hi. Our main training code will be released in these two weeks. I hope it can help you after it is released.

ZHKKKe · 2021-01-28T09:11:21Z

Hi, all,

Our main code for SOC adaptation is available now。

ZHKKKe closed this as completed Dec 11, 2020

ZHKKKe reopened this Dec 24, 2020

ZHKKKe closed this as completed Jan 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SOC problem #6

SOC problem #6

DavidKong96 commented Dec 8, 2020 •

edited

ZHKKKe commented Dec 9, 2020

DavidKong96 commented Dec 9, 2020

ZHKKKe commented Dec 9, 2020

DavidKong96 commented Dec 9, 2020

TsykunovDmitriy commented Dec 23, 2020 •

edited

ZHKKKe commented Dec 24, 2020

TsykunovDmitriy commented Dec 24, 2020 •

edited

ZHKKKe commented Dec 24, 2020 •

edited

TsykunovDmitriy commented Dec 24, 2020

TsykunovDmitriy commented Dec 25, 2020 •

edited

ZHKKKe commented Dec 25, 2020 •

edited

TsykunovDmitriy commented Jan 13, 2021

ZHKKKe commented Jan 13, 2021

ZHKKKe commented Jan 28, 2021

SOC problem #6

SOC problem #6

Comments

DavidKong96 commented Dec 8, 2020 • edited

ZHKKKe commented Dec 9, 2020

DavidKong96 commented Dec 9, 2020

ZHKKKe commented Dec 9, 2020

DavidKong96 commented Dec 9, 2020

TsykunovDmitriy commented Dec 23, 2020 • edited

ZHKKKe commented Dec 24, 2020

TsykunovDmitriy commented Dec 24, 2020 • edited

ZHKKKe commented Dec 24, 2020 • edited

TsykunovDmitriy commented Dec 24, 2020

TsykunovDmitriy commented Dec 25, 2020 • edited

ZHKKKe commented Dec 25, 2020 • edited

TsykunovDmitriy commented Jan 13, 2021

ZHKKKe commented Jan 13, 2021

ZHKKKe commented Jan 28, 2021

DavidKong96 commented Dec 8, 2020 •

edited

TsykunovDmitriy commented Dec 23, 2020 •

edited

TsykunovDmitriy commented Dec 24, 2020 •

edited

ZHKKKe commented Dec 24, 2020 •

edited

TsykunovDmitriy commented Dec 25, 2020 •

edited

ZHKKKe commented Dec 25, 2020 •

edited