Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why I get killed? #41

Open
misi0202 opened this issue Jul 7, 2023 · 8 comments
Open

Why I get killed? #41

misi0202 opened this issue Jul 7, 2023 · 8 comments

Comments

@misi0202
Copy link

misi0202 commented Jul 7, 2023

I use CLI python main.py --train --base configs/stableSRNew/v2-finetune_text_T_512.yaml --gpus GPU_ID, --name NAME --scale_lr False
and i get killed
截屏2023-07-07 02 05 08
At first ,i reduce the batch_size and num_workers to 1,but it cant work
As a possibility,I use ckpthttps://huggingface.co/stabilityai/stable-diffusion-x4-upscaler
I am not sure about that,HELP!!!!!

@misi0202
Copy link
Author

misi0202 commented Jul 7, 2023

I use CLI python main.py --train --base configs/stableSRNew/v2-finetune_text_T_512.yaml --gpus GPU_ID, --name NAME --scale_lr False and i get killed 截屏2023-07-07 02 05 08 At first ,i reduce the batch_size and num_workers to 1,but it cant work As a possibility,I use ckpthttps://huggingface.co/stabilityai/stable-diffusion-x4-upscaler I am not sure about that,HELP!!!!!

I am back,it is not reachable in ckpthttps://huggingface.co/stabilityai/stable-diffusion-2-1-base,this confused about me.

@IceClear
Copy link
Owner

IceClear commented Jul 7, 2023

The most possible reason is that your cpu memory is not enough.

@misi0202
Copy link
Author

misi0202 commented Jul 7, 2023

@IceClear Thanks for your reply!!
maybe is this problem,but if i cant change me cpu memory,is there have alternative choice?

@IceClear
Copy link
Owner

IceClear commented Jul 7, 2023

Sry. Currently I have no ideas about which part leads to high memory cost. So no solution from my side.

@IceClear
Copy link
Owner

IceClear commented Jul 7, 2023

Maybe you can check if the model ckpt is loaded twice since in my current setting, the diffusion model and vqgan are initialized from the same large ckpt and I suspect maybe two copies of this large ckpt exist in cpu ram. Not sure about that.

@misi0202
Copy link
Author

misi0202 commented Jul 7, 2023

Maybe you can check if the model ckpt is loaded twice since in my current setting, the diffusion model and vqgan are initialized from the same large ckpt and I suspect maybe two copies of this large ckpt exist in cpu ram. Not sure about that.

Thanks for you patient reply! I will check about that soon!

@IceClear
Copy link
Owner

IceClear commented Jul 7, 2023

Maybe you can check if the model ckpt is loaded twice since in my current setting, the diffusion model and vqgan are initialized from the same large ckpt and I suspect maybe two copies of this large ckpt exist in cpu ram. Not sure about that.

Thanks for you patient reply! I will check about that soon!

Looking forward to your reply.

@misi0202
Copy link
Author

Maybe you can check if the model ckpt is loaded twice since in my current setting, the diffusion model and vqgan are initialized from the same large ckpt and I suspect maybe two copies of this large ckpt exist in cpu ram. Not sure about that.

Thanks for you patient reply! I will check about that soon!

Looking forward to your reply.

SAD,it doesn't work,but thanks to your contribution for this issue,hope I can use it in the future! Have a nice Day!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants