Training a ControlNet to generate furnished room -> empty room (and vice versa). Improvement plateau... #659

whydna · 2024-03-15T14:28:59Z

I'm working on a project to take images of furnished rooms and remove all the furniture. I've got a large dataset of image pairs. I'm not using any preprocessing on the images so as to allow the model to preserve details of the original image (wall color, floor material, etc.).

After training on a 4090 for about 5 days, and I'm no longer seeing any improvement (see examples below).

I'm looking to get tips about where to go from here.

Does it just need to be trained longer?
Do I need to adjust the learning rate?
Should I spend more time cleaning the dataset (a small % of the dataset is probably bad, as you can see in one of the examples below, the target image is dark).
Should I preprocess the image to simplify this? (i.e MLSD) It would lose the details of the original, but maybe at least will provide better output for final image.
Perhaps ControlNet isn't the right arch for this and instead use pix2pix?

Thanks for the help!

Example 1

Source:

Target:

Model Result:

Example 2

Source:

Target:

Model Result:

First Training Run

Second Training Run

dereksun105 · 2024-06-11T03:22:03Z

how large was your dataset?

innat-asj · 2024-07-14T09:11:15Z

Controlnet is not the right arch for this, instead play around with inpainting methods.

whydna · 2024-07-14T12:19:41Z

@innat-asj can you elaborate a bit? ty!

innat-asj · 2024-07-14T14:08:50Z

It's only my understanding about the architecture. The control-net doesn't required for the removal operations. Coz, there is nothing to control. Instead, there are few option we could try to remove the object.

SD Inpainting - check this article.
Mi-GAN - more.

I tried Mi-GAN out of the box with the given checkpoint and its promising. Hence, if it could be trained for specific task, it would be better. I also tried LaMa and MAT, but I found MI-GAN better in terms of simplicity and performance.

Lastly, reversing the above process won't work for empty room to furnished room. It requires additional stuff. In that case, control-net will be required.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training a ControlNet to generate furnished room -> empty room (and vice versa). Improvement plateau... #659

Training a ControlNet to generate furnished room -> empty room (and vice versa). Improvement plateau... #659

whydna commented Mar 15, 2024 •

edited

Loading

dereksun105 commented Jun 11, 2024

innat-asj commented Jul 14, 2024

whydna commented Jul 14, 2024

innat-asj commented Jul 14, 2024 •

edited

Loading

Training a ControlNet to generate furnished room -> empty room (and vice versa). Improvement plateau... #659

Training a ControlNet to generate furnished room -> empty room (and vice versa). Improvement plateau... #659

Comments

whydna commented Mar 15, 2024 • edited Loading

Example 1

Example 2

First Training Run

Second Training Run

dereksun105 commented Jun 11, 2024

innat-asj commented Jul 14, 2024

whydna commented Jul 14, 2024

innat-asj commented Jul 14, 2024 • edited Loading

whydna commented Mar 15, 2024 •

edited

Loading

innat-asj commented Jul 14, 2024 •

edited

Loading