training stuck at scale 9:[1999/2000] #19

phonygene · 2019-11-04T10:22:25Z

Training constantly stuck at [1999/2000]
such as "scale 7:[1999/2000]" or "scale 9:[1999/2000]".

Can't interrupt it even if I used ctrl+c, it's totally dead.

I used a mountain picture, resized to same size as one of your sample image.

I'm using :
python 3.6.8
torch 1.3.0

GPU rtx2080ti
NVIDIA Driver 419.35
CUDA 10.1

sno6 · 2019-11-04T22:32:50Z

Having the same issue running on Google Colab: seems to stall out at scale 8:[1999/2000]

tamarott · 2019-11-05T17:11:50Z

This seems to be a memory problem. When the number of scales is large, there are more model parameters to store.

phonygene · 2019-11-06T07:03:28Z

Sorry, I had fat-fingered.(clicked on Close button accidentally.)

I 've checked GPU memory usage while training.
It truly was nearly full loaded when the training stuck.

@tamarott
Do you have any suggestion ?
Is it possible to reduce the batch size or something for avoiding this ?
Or restart training from last checkpoint ?

I read your paper. There's an example of the starry night.
Seems like it goes well on scale 8.
But when I tried Random Samples on scale 8 , it just generated 50 images which are exactly the same as each other.

JonathanFly · 2019-11-06T08:45:09Z

With 16GB of GPU memory, the highest resolution output I have achieved is 667 x 413 from the main training script. Does that seem right? Would changing the aspect ratio let me squeeze more pixels into the model so I can also get more in the final random samples?

phonygene · 2019-11-06T09:47:29Z

OH, it turned out that scale 0 just worked fine .
And as the scale increases, the differences between output images drop sharply.
The images generated by scale 1 have slightly shift-effect, and the images generated by scale 3 are almost the same.
So, at this rate, there's no need to train over scale 3 at all.

This is pretty amazing .
Thanks for sharing your elegant work.

rickdotta · 2019-11-08T01:05:56Z

@phonygene what do you mean by you dont need to train over scale 3? Is it possible to generate arbitrary sized images using just scale 3? How?

Thank you!

phonygene · 2019-11-08T10:14:57Z

@rickdotta As I said : In my case, when training scale larger than scale 3 , it only generated identical images, so I tried scale 0 model and found out that it worked fine. I don't understand why it works so differently from the paper, but at least It saves me a lot of time ( troll face ) .

xivh · 2020-05-28T02:59:28Z

@phonygene How do you stop training at a smaller scale?

phonygene closed this as completed Nov 6, 2019

phonygene reopened this Nov 6, 2019

phonygene closed this as completed Nov 6, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

training stuck at scale 9:[1999/2000] #19

training stuck at scale 9:[1999/2000] #19

phonygene commented Nov 4, 2019 •

edited

Loading

sno6 commented Nov 4, 2019

tamarott commented Nov 5, 2019

phonygene commented Nov 6, 2019

JonathanFly commented Nov 6, 2019 •

edited

Loading

phonygene commented Nov 6, 2019 •

edited

Loading

rickdotta commented Nov 8, 2019

phonygene commented Nov 8, 2019 •

edited

Loading

xivh commented May 28, 2020

training stuck at scale 9:[1999/2000] #19

training stuck at scale 9:[1999/2000] #19

Comments

phonygene commented Nov 4, 2019 • edited Loading

sno6 commented Nov 4, 2019

tamarott commented Nov 5, 2019

phonygene commented Nov 6, 2019

JonathanFly commented Nov 6, 2019 • edited Loading

phonygene commented Nov 6, 2019 • edited Loading

rickdotta commented Nov 8, 2019

phonygene commented Nov 8, 2019 • edited Loading

xivh commented May 28, 2020

phonygene commented Nov 4, 2019 •

edited

Loading

JonathanFly commented Nov 6, 2019 •

edited

Loading

phonygene commented Nov 6, 2019 •

edited

Loading

phonygene commented Nov 8, 2019 •

edited

Loading