How to improve image matting accuracy #24

zzmao · 2020-12-15T06:50:38Z

Hi,

Thanks for this great project. I tried your colab for image matting, it looks like the boundary is not clear enough for some inputs(also the one in Github readme).

Is there anyway to improve image matting accuracy?

ZHKKKe · 2020-12-15T15:16:48Z

Hi, thanks for your attention.

Can you share some failure cases with me?
If the foreground color is very similar to the background, the blurred boundaries are reasonable.
For the image in the Github README file, since we converted it to the GIF format, I think its quality is too low to get good results.

mosvlad · 2020-12-18T20:16:41Z

Standard Lena image

ZHKKKe · 2020-12-19T08:36:59Z

@mosvlad
I am sorry that such a wrong output is make sense in our model due to (1) similar foreground and background color; (2) we have limited training data.

zzmao · 2020-12-19T20:52:44Z

Thanks for replying @ZHKKKe
Looks like the project is target for video matting. Is there anything we can do to optimize image(portrait) matting? (Volunteer myself to work on this)

ZHKKKe · 2020-12-20T05:52:03Z

@zzmao
The main problem of our current model is its relatively poor performance in portrait semantic estimation.
I think one possible solution is to improve the performance of the backbone model, i.e., the MobileNetV2, in MODNet.

alan-ai-learner · 2020-12-21T17:38:41Z

@ZHKKKe What should be the approach to improve the performance of the backbone model, i.e., the MobileNetV2?
and also can you tell me what type of data you used for training? Can you please tell me the approach to train on our own data set?

ZHKKKe · 2020-12-22T04:42:38Z

@alan-ai-learner
Q1: What should be the approach to improve the performance of the backbone model, i.e., the MobileNetV2?
You can replace the MobileNetV2 with a more powerful model, e.g., DeepLabV3+. Besides, you may need more labeled training data. You may be interested in the large labeled dataset that will be released soon by BackgroundMattingV2.

Q2: can you tell me what type of data you used for training?
Each of our supervised training samples is a pair of (RGB image, labeled matte). The unlabeled samples used in our SOC adaptation are the RGB images.

Q3: Can you please tell me the approach to train on our own data set?
Our training code will be released next month. The code will contain a template for implementing the new dataloader. It allows you to train on your own datasets. We will also provide a guideline on how to do this.

alan-ai-learner · 2020-12-22T15:59:29Z

Thanks, but in # BackgroundMattingV2 they are using a different approach, in # BackgroundMattingV2 we need to pass two images one with the subject and the other is without a subject (only background). But in MODnet we need to pass only one image.
so how can we utilize the dataset?

ZHKKKe · 2020-12-22T17:14:21Z

@alan-ai-learner
Yes. I think their dataset only consists of the labeled foregrounds. They use the images from other datasets, like COCO, to composite the training samples. Therefore, their dataset can be used to train MODNet (You only need to input the composited images for training, i.e., you do not need to input the separate background images).

newjavaer · 2020-12-25T06:44:08Z

@ZHKKKe Do you think there will be an improvement in accuracy by combining supervised training with the estimation of foreground color?

alan-ai-learner · 2020-12-25T07:37:45Z

@ZHKKKe got it

ZHKKKe · 2020-12-25T14:17:19Z

@newjavaer
I think that might help, but I'm not sure about it.
The lack of the labeled data is a more crucial problem.
The current version of MODNet is mostly wrong with portrait semantic estimation, rather than detail prediction.

newjavaer · 2020-12-26T07:44:46Z

@ZHKKKe Why do you think there are more errors in semantic estimation?
Except for the lack of data.

QuantumLiu · 2020-12-27T11:34:47Z

Many erros are caused by recongnizing cloth as a part of person. Maybe can be improved by giving some penalty during the training. I have collected some person matting data from several semantic datasets, and new CELEBA-MASK-HQ dataset, it may imporve the result.

newjavaer · 2020-12-28T08:09:10Z

@QuantumLiu By adding some background images as negative samples to the training?

ZHKKKe · 2020-12-28T15:16:19Z

@newjavaer
Semantic estimation is a high-level vision task. It is much more difficult than detail prediciton (low-level vision).

@ZHKKKe Why do you think there are more errors in semantic estimation?
Except for the lack of data.

ZHKKKe · 2020-12-28T15:19:14Z

@QuantumLiu
Yes. If we use the data from semantic datasets to train the Low-Resolution Branch of MODNet, we should get a more stable results.

The solution proposed by @newjavaer is also great. We do not consider the negtive samples during training since it is a engineering problem.

ZHKKKe · 2021-01-28T09:24:57Z

Please feel free to reopen this question if you have any questions.

syfbme · 2021-11-03T03:05:23Z

Has anyone tried replacing Low-Resolution Branch backbone. How is the result?
@QuantumLiu @mosvlad @zzmao
cc @ZHKKKe

ZHKKKe · 2021-11-29T16:19:22Z

@syfbme The performance may be further improved. Please refer to https://github.com/PaddlePaddle/PaddleSeg/tree/release/2.3/contrib/Matting

ZHKKKe closed this as completed Jan 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to improve image matting accuracy #24

How to improve image matting accuracy #24

zzmao commented Dec 15, 2020

ZHKKKe commented Dec 15, 2020

mosvlad commented Dec 18, 2020

ZHKKKe commented Dec 19, 2020 •

edited

zzmao commented Dec 19, 2020

ZHKKKe commented Dec 20, 2020

alan-ai-learner commented Dec 21, 2020

ZHKKKe commented Dec 22, 2020

alan-ai-learner commented Dec 22, 2020

ZHKKKe commented Dec 22, 2020

newjavaer commented Dec 25, 2020

alan-ai-learner commented Dec 25, 2020

ZHKKKe commented Dec 25, 2020

newjavaer commented Dec 26, 2020

QuantumLiu commented Dec 27, 2020

newjavaer commented Dec 28, 2020 •

edited

ZHKKKe commented Dec 28, 2020

ZHKKKe commented Dec 28, 2020

ZHKKKe commented Jan 28, 2021

syfbme commented Nov 3, 2021

ZHKKKe commented Nov 29, 2021

How to improve image matting accuracy #24

How to improve image matting accuracy #24

Comments

zzmao commented Dec 15, 2020

ZHKKKe commented Dec 15, 2020

mosvlad commented Dec 18, 2020

ZHKKKe commented Dec 19, 2020 • edited

zzmao commented Dec 19, 2020

ZHKKKe commented Dec 20, 2020

alan-ai-learner commented Dec 21, 2020

ZHKKKe commented Dec 22, 2020

alan-ai-learner commented Dec 22, 2020

ZHKKKe commented Dec 22, 2020

newjavaer commented Dec 25, 2020

alan-ai-learner commented Dec 25, 2020

ZHKKKe commented Dec 25, 2020

newjavaer commented Dec 26, 2020

QuantumLiu commented Dec 27, 2020

newjavaer commented Dec 28, 2020 • edited

ZHKKKe commented Dec 28, 2020

ZHKKKe commented Dec 28, 2020

ZHKKKe commented Jan 28, 2021

syfbme commented Nov 3, 2021

ZHKKKe commented Nov 29, 2021

ZHKKKe commented Dec 19, 2020 •

edited

newjavaer commented Dec 28, 2020 •

edited