Small models for 11GB GPUs #26

justanhduc · 2021-11-30T06:48:02Z

Hi. Thanks for opensourcing this amazing project. I am trying to train the network but I got OOM problem as I don't have any 16GB GPU. Could you please let me know which small models can I try on a 11GB GPU? Thanks so much!

rinongal · 2021-11-30T07:48:08Z

Hey!

If you want to decrease memory use, the following are all viable options:

Disable the layer freezing module by setting auto_layer_iters to 0. If you're only doing texture-based changes then you probably don't need to freeze layers and this can save you a good chunk of memory.
Use a lower resolution model (FFHQ 256, LSUN Church etc.).
Only use one of the two CLIP models (ViT-B/32 is better for global textures, ViT-B/16 is a bit better for local textures and shapes).
Decrease n_sample (number of output images during training).

If you just want to play with the model and don't want to do things like dogs to cats, I'd start with options (1) and (4) since they might be enough. We managed to train an FFHQ 1024x1024 model on a 1080 Ti, so 11GB should probably be doable.

justanhduc · 2021-12-01T07:11:57Z

Hi @rinongal. Thanks for your tips. Indeed, (1) already reduced a lot of memory and made the training fit on a single 11GB GPU. However, it seems like the quality of the output is not as good as the original version which I checked via Colab. (3) actually didn't affect much as I observed. (4) alone cannot make the training possible either. I guess then (2) would be the most suitable solution if I want to keep the same translation quality, am I correct?

rinongal · 2021-12-01T07:19:18Z

You could try combining (1) with lowering the learning rate and increasing the number of iterations. Some previous issues reported better results when reducing learning rates when training with style image targets. It might help in your case as well.

Other than that, I'm afraid (2) might be your best option for reducing memory requirements.

rinongal · 2021-12-01T07:20:25Z

What options did you run in the Colab, btw? The layer freezing isn't enabled there by default (it's only turned on if you click on improve shape).

justanhduc · 2021-12-01T08:17:50Z

What options did you run in the Colab, btw? The layer freezing isn't enabled there by default (it's only turned on if you click on improve shape).

Oops the results I used as reference weren't with improve shape. I thought improve shape will enable mixing noise. So is the config without improve shape in Colab totally equivalent to (1)?

rinongal · 2021-12-01T09:08:11Z

The config without improve shape in Colab is (1) + only ViT-B/32 (so (3)) and no mixing.

rinongal · 2022-01-22T21:38:10Z

Closing due to lack of activity. Feel free to re-open if you need additional help.

rinongal closed this as completed Jan 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Small models for 11GB GPUs #26

Small models for 11GB GPUs #26

justanhduc commented Nov 30, 2021

rinongal commented Nov 30, 2021

justanhduc commented Dec 1, 2021

rinongal commented Dec 1, 2021

rinongal commented Dec 1, 2021

justanhduc commented Dec 1, 2021

rinongal commented Dec 1, 2021

rinongal commented Jan 22, 2022

Small models for 11GB GPUs #26

Small models for 11GB GPUs #26

Comments

justanhduc commented Nov 30, 2021

rinongal commented Nov 30, 2021

justanhduc commented Dec 1, 2021

rinongal commented Dec 1, 2021

rinongal commented Dec 1, 2021

justanhduc commented Dec 1, 2021

rinongal commented Dec 1, 2021

rinongal commented Jan 22, 2022