Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions about OFD strategy #68

Closed
Carringbrinks opened this issue Feb 19, 2021 · 23 comments
Closed

Questions about OFD strategy #68

Carringbrinks opened this issue Feb 19, 2021 · 23 comments

Comments

@Carringbrinks
Copy link

Hello
Thanks for contributing to MODNet project.
I want to know if the code about OFD strategy is in the inference file?
How to apply real-time inference about OFD strategy?

@ZHKKKe
Copy link
Owner

ZHKKKe commented Feb 20, 2021

Hi, thanks for your attention.
The code of OFD is not included now, but it will coming soon. :)

@xafha
Copy link

xafha commented Mar 5, 2021

Hi, thanks for your attention.
The code of OFD is not included now, but it will coming soon. :)

请加速啊,谢谢。

@luoww1992
Copy link

+1

@Carringbrinks
Copy link
Author

Carringbrinks commented Apr 1, 2021 via email

@ZHKKKe
Copy link
Owner

ZHKKKe commented Apr 1, 2021

@Carringbrinks
The first step is enlarge your background set... We use more than 500k backgrounds for composition (Place-365 dataset).

@Carringbrinks
Copy link
Author

Carringbrinks commented Apr 1, 2021 via email

@ZHKKKe
Copy link
Owner

ZHKKKe commented Apr 1, 2021

@Carringbrinks
We use only about 3k labeled foreground. We composite 100k training samples. For each sample, we use a background randomly selected from 500k backgrounds.

@JerryDeepl
Copy link

@Carringbrinks
The first step is enlarge your background set... We use more than 500k backgrounds for composition (Place-365 dataset).

Hello, could you share how to pick up the background images from Place-365 dataset ? thanks

@ZHKKKe
Copy link
Owner

ZHKKKe commented Apr 14, 2021

@JerryDeepl
We use a face detection model to remove all images included persons.

@JerryDeepl
Copy link

@ZHKKKe
Got it. Thanks

@Carringbrinks
Copy link
Author

Carringbrinks commented Apr 19, 2021 via email

@ZHKKKe
Copy link
Owner

ZHKKKe commented Apr 19, 2021

@Carringbrinks
For the methods without official code, we reproduced them.

@Carringbrinks
Copy link
Author

Carringbrinks commented Apr 20, 2021 via email

@ZHKKKe
Copy link
Owner

ZHKKKe commented Apr 22, 2021

@Carringbrinks
I am sorry that I do not have permission to share the pre-trained models as these models are trained on the private dataset.
(These models are not constrained by any license, so others may use them for any purposes).

@Carringbrinks
Copy link
Author

Carringbrinks commented Apr 22, 2021 via email

@ZHKKKe
Copy link
Owner

ZHKKKe commented Apr 22, 2021

@Carringbrinks
We reproduced the model structures and trained them ourselves.

@Carringbrinks
Copy link
Author

Carringbrinks commented Apr 22, 2021 via email

@Carringbrinks
Copy link
Author

Carringbrinks commented Apr 29, 2021 via email

@ZHKKKe
Copy link
Owner

ZHKKKe commented Apr 29, 2021

@Carringbrinks
Based on my experience, there are several problems may cause NaN loss in SOC:

  1. The loss is too large, please reduce the soc loss scale.
  2. The target domain is too different from the source domain (If you can share some images of the two domains, I may give you some judgement).
  3. There are some bad samples in the unlabeled dataset, e.g., there are no portrait in the input image.

@Carringbrinks
Copy link
Author

Carringbrinks commented Apr 29, 2021 via email

@ZHKKKe
Copy link
Owner

ZHKKKe commented Apr 29, 2021

@Carringbrinks
Yes... It may will... The domain of the cartoon data is to far from the real-world data.

@Carringbrinks
Copy link
Author

Carringbrinks commented May 9, 2021 via email

@ZHKKKe
Copy link
Owner

ZHKKKe commented May 12, 2021

@Carringbrinks
Hi, for your questions:
Q1: What is wrong with my data set?
There is also an over-fitting problem in our model, which is why we need a SOC strategy.
Besides, have you calculated the foreground color before image composition? This step is vital for getting more realistic training samples. If the foreground color is not be calculated, the gap between the synthetic samples and the natural images will be larger.

Q2: Do you know where the problem is?
What's your input size?
The resolution of the image in OpenImage is usually small. Therefore, you must upsample them for image synthesis, which will result in blurred backgrounds. You can try to replace the background set with some open source image super-resolution data set.

@ZHKKKe ZHKKKe closed this as completed Jun 21, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants