Influence: Amount of training data and data augmentation via Detectron2 #71

Alexander-Tack · 2021-12-22T13:15:49Z

Hi,

Thank you for sharing LaMa! The inpainting quality is really impressive!

I was wondering:

Comparing "LaMa-Fourier" with "Big LaMa-Fourier": How much did the larger training data (4.5M images from the Places-Challenge dataset) contribute to the improved quality of Big LaMa-Fourier? Do you think that similar results could have also been achieved for Big LaMa-Fourier with less data?
You have proposed a sophisticated approach for data augmentation. How much did the training and the inference benefit from data augmentation using segmentation masks from Detectron2?

Best wishes,
Alex

windj007 · 2022-01-19T10:48:14Z

Hi! Sorry for the late reply!

How much did the larger training data (4.5M images from the Places-Challenge dataset) contribute to the improved quality of Big LaMa-Fourier?

Larger data helps, but significantly less than larger model size and other training tricks (SegmPL, large masks)

Do you think that similar results could have also been achieved for Big LaMa-Fourier with less data?

Less data = less quality, but not dramatically - reducing model size or removing SegmPL or using smaller training masks would hurt more

How much did the training and the inference benefit from data augmentation using segmentation masks from Detectron2?

We do not use segmentation masks from Detectron for training. We tried it in the very beginning of the project, but faced technical issues (slow, gpu memory consumption, cuda-reinitialization limitation) - so we could not use segmentation-based mask generation effectively during training. It is there just because we forgot to remove it when preparing a public code release. Please note that segm_proba: 0 in all the data configs.

windj007 · 2022-01-19T10:48:55Z

In our experience, mask widths matter much more than mask shapes

Alexander-Tack · 2022-01-20T13:19:11Z

Hi Roman,
awesome!! Thank you for sharing your experience! This really helps a lot and confirms my preliminary findings :)

Alexander-Tack closed this as completed Jan 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Influence: Amount of training data and data augmentation via Detectron2 #71

Influence: Amount of training data and data augmentation via Detectron2 #71

Alexander-Tack commented Dec 22, 2021

windj007 commented Jan 19, 2022

windj007 commented Jan 19, 2022

Alexander-Tack commented Jan 20, 2022

Influence: Amount of training data and data augmentation via Detectron2 #71

Influence: Amount of training data and data augmentation via Detectron2 #71

Comments

Alexander-Tack commented Dec 22, 2021

windj007 commented Jan 19, 2022

windj007 commented Jan 19, 2022

Alexander-Tack commented Jan 20, 2022