LORA on 8GB Graphics Cards is broken #755

JasonEpic · 2023-01-11T03:10:57Z

Kindly read the entire form below and fill it out with the requested information.

Please find the following lines in the console and paste them below. If you do not provide this information, your
issue will be automatically closed.

`
Python revision: 3.10.6 (tags/v3.10.6:9c7b4bd, Aug 1 2022, 21:53:49) [MSC v.1932 64 bit (AMD64)]
Dreambooth revision: 5588089
SD-WebUI revision: 9cfd10cdefc7b2966b8e42fbb0e05735967cf87b

Checking Dreambooth requirements...
[+] bitsandbytes version 0.35.0 installed.
[+] diffusers version 0.10.2 installed.
[+] transformers version 4.25.1 installed.
[+] xformers version 0.0.14.dev0 installed.
[+] torch version 1.12.1+cu113 installed.
[+] torchvision version 0.13.1+cu113 installed.
`

Have you read the Readme?
Yes, a couple of times trying to figure out filewords (which I finally was able to use after using "[filewords]" instead of "[Filewords]", anyways that's a tangent)
Have you completely restarted the stable-diffusion-webUI, not just reloaded the UI?
Yes
Have you updated Dreambooth to the latest revision?
Yes
Have you updated the Stable-Diffusion-WebUI to the latest version?
Yes
No, really. Please save us both some trouble and update the SD-WebUI and Extension and restart before posting this.
Reply 'OK' Below to acknowledge that you did this.
I have the "git pull https://github.com/AUTOMATIC1111/stable-diffusion-webui" in my "wenui-user.bat" file
Describe the bug

Some other issues have this same problem, but their solution of reverting to an old version of dreambooth doesn't work for me.

I have a 8gb 3070 graphics card and a bit over a week ago was able to use LORA to train a model on my graphics card, I followed This Video on how to set up the settings for running dreambooth on less than 8gb of vram (using the "LORA DB - Low VRAM" settings at 4:10).

The settings worked and I was able to actually train models without it saying "CUDA out of memory". I didn't update my WebUi or dreambooth for a while until yesterday when I updated to the latest versions of both. I tried training a new model using the same settings and got the error "Exception training model: 'No executable batch size found, reached zero" I fiddled around with the settings a bit and it still wouldn't work.

Then I checked here and saw some people were having the same issue. People were recommending I go back to older versions of the extension so I did, I went back to the version I was using when it was working fine, eb47a0b, I ran it again, same settings, and got a different error "Exception training model: too many values to unpack (expected 2)"

Then I tried going back to the older version of the webui I was using when it worked 4af3ca5 and tried again, same error. Then I tried using commit fab41d8 and got the same "too many values to unpack" error

Finally I made a brand new, fresh install of the webui in a different folder using the version where it was working, manually installed the old version of dreambooth that was working, and made a new model, and manually added in the settings following the video just as I did over a week ago when it worked. And even then I still get the same "too many values to unpack" error.

At this point I don't know if its something wrong with the extension, the webui, or something else like python or torch or whatever, I have no knowledge of code at all so I can't deduce the issue any more than this.

I'll provide the logs for the "Exception training model: too many values to unpack (expected 2)" bug in the comments.
These logs are for the "Exception training model: 'No executable batch size found, reached zero" bug

Provide logs
Steps: 0%| | 0/9200 [00:00<?, ?it/s]OOM Detected, reducing batch/grad size to 0/1.
Traceback (most recent call last):
File "E:\stable-diffusion-webui\extensions\sd_dreambooth_extension\dreambooth\memory.py", line 116, in decorator
return function(batch_size, grad_size, prof, *args, **kwargs)
File "E:\stable-diffusion-webui\extensions\sd_dreambooth_extension\dreambooth\train_dreambooth.py", line 861, in inner_loop
accelerator.backward(loss)
File "E:\stable-diffusion-webui\venv\lib\site-packages\accelerate\accelerator.py", line 1314, in backward
self.scaler.scale(loss).backward(**kwargs)
File "E:\stable-diffusion-webui\venv\lib\site-packages\torch_tensor.py", line 396, in backward
torch.autograd.backward(self, gradient, retain_graph, create_graph, inputs=inputs)
File "E:\stable-diffusion-webui\venv\lib\site-packages\torch\autograd_init_.py", line 173, in backward
Variable.execution_engine.run_backward( # Calls into the C++ engine to run the backward pass
File "E:\stable-diffusion-webui\venv\lib\site-packages\torch\autograd\function.py", line 253, in apply
return user_fn(self, *args)
File "E:\stable-diffusion-webui\venv\lib\site-packages\torch\utils\checkpoint.py", line 146, in backward
torch.autograd.backward(outputs_with_grad, args_with_grad)
File "E:\stable-diffusion-webui\venv\lib\site-packages\torch\autograd_init.py", line 173, in backward
Variable._execution_engine.run_backward( # Calls into the C++ engine to run the backward pass
RuntimeError: CUDA out of memory. Tried to allocate 20.00 MiB (GPU 0; 8.00 GiB total capacity; 7.18 GiB already allocated; 0 bytes free; 7.28 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF
Steps: 0%| | 0/9200 [00:03<?, ?it/s]
Traceback (most recent call last):
File "E:\stable-diffusion-webui\extensions\sd_dreambooth_extension\scripts\dreambooth.py", line 561, in start_training
result = main(config, use_txt2img=use_txt2img)
File "E:\stable-diffusion-webui\extensions\sd_dreambooth_extension\dreambooth\train_dreambooth.py", line 973, in main
return inner_loop()
File "E:\stable-diffusion-webui\extensions\sd_dreambooth_extension\dreambooth\memory.py", line 114, in decorator
raise RuntimeError("No executable batch size found, reached zero.")
RuntimeError: No executable batch size found, reached zero.
Training completed, reloading SD Model.
Restored system models.
Returning result: Exception training model: 'No executable batch size found, reached zero.'.

Environment

Windows 10 Home
Version: 21H2

If Windows - WSL or native?

Native

What GPU are you using?

Asus DUAL GeForce RTX 3070 8 GB

Screenshots/Config

db_config.txt

The text was updated successfully, but these errors were encountered:

JasonEpic · 2023-01-11T03:19:23Z

Logs for the "Exception training model: too many values to unpack (expected 2)" bug

Initializing dreambooth training...
Replace CrossAttention.forward to use xformers
Class image dir is not set, defaulting to E:\DreamboothTraining\stable-diffusion-webui\models\dreambooth\MODELNAME\classifiers_0.
Loaded model.
Allocated: 0.0GB
Reserved: 0.0GB

Injecting trainable lora...
CUDA SETUP: Loading binary E:\DreamboothTraining\stable-diffusion-webui\venv\lib\site-packages\bitsandbytes\libbitsandbytes_cudaall.dll...
Exception parsing instance image: cannot identify image file 'E:\DreamboothTraining\stable-diffusion-webui\venv\Lib\site-packages\numpy\core\tests\data\recarray_from_file.fits'
Concept sks has 1 sample prompts.
Traceback (most recent call last):
File "E:\DreamboothTraining\stable-diffusion-webui\extensions\sd_dreambooth_extension\scripts\dreambooth.py", line 470, in start_training
result = main(config, mem_record, use_subdir=use_subdir, lora_model=lora_model_name,
File "E:\DreamboothTraining\stable-diffusion-webui\extensions\sd_dreambooth_extension\dreambooth\train_dreambooth.py", line 503, in main
train_dataset, train_dataloader = cache_latents(enc_vae=vae, orig_dataset=gen_dataset)
File "E:\DreamboothTraining\stable-diffusion-webui\extensions\sd_dreambooth_extension\dreambooth\train_dreambooth.py", line 489, in cache_latents
text_encoder_cache.append(text_encoder(d_batch["input_ids"])[0])
File "E:\DreamboothTraining\stable-diffusion-webui\venv\lib\site-packages\torch\nn\modules\module.py", line 1130, in _call_impl
return forward_call(*input, **kwargs)
File "E:\DreamboothTraining\stable-diffusion-webui\venv\lib\site-packages\transformers\models\clip\modeling_clip.py", line 811, in forward
return self.text_model(
File "E:\DreamboothTraining\stable-diffusion-webui\venv\lib\site-packages\torch\nn\modules\module.py", line 1130, in _call_impl
return forward_call(*input, **kwargs)
File "E:\DreamboothTraining\stable-diffusion-webui\venv\lib\site-packages\transformers\models\clip\modeling_clip.py", line 710, in forward
bsz, seq_len = input_shape
ValueError: too many values to unpack (expected 2)
Training completed, reloading SD Model.
Allocated: 0.0GB
Reserved: 0.0GB

Memory output: {}
Restored system models.
Allocated: 2.2GB
Reserved: 2.2GB

Returning result: Exception training model: too many values to unpack (expected 2)
db_config.txt

ovladuk · 2023-01-12T19:47:08Z

it doesn't look like it will be possible on 8gb for quite a while now

AyoKeito · 2023-01-13T14:55:50Z

Downgrade: https://www.reddit.com/r/StableDiffusion/comments/1062b6s/comment/j3gj1yu/?context=3

Exit from automatic1111 (close the the whole thing down)
Click this link [https://github.com/d8ahazard/sd_dreambooth_extension/tree/c5cb58328c555ac27679422b1da940a9b19de6f2] to go to a revision that works for me. Thats the 22 December dreambooth extension for automatic1111. Then :-
Click on the green link and download the zip of that commit.
You might want to take a backup of C:\stable-diffusion\stable-diffusion-webui\extensions\sd_dreambooth_extension before you take the next step.
Unzip it, and overwrite, to the C:\stable-diffusion\stable-diffusion-webui\extensions\sd_dreambooth_extension directory.
When you next run automatic1111 DON'T UPDATE THE EXTENTIONS. (Hopefully the extension will be updated soon so that it works again for 8Gb cards)

ovladuk · 2023-01-14T00:00:46Z

Downgrade: https://www.reddit.com/r/StableDiffusion/comments/1062b6s/comment/j3gj1yu/?context=3

Exit from automatic1111 (close the the whole thing down)
Click this link [https://github.com/d8ahazard/sd_dreambooth_extension/tree/c5cb58328c555ac27679422b1da940a9b19de6f2] to go to a revision that works for me. Thats the 22 December dreambooth extension for automatic1111. Then :-
Click on the green link and download the zip of that commit.
You might want to take a backup of C:\stable-diffusion\stable-diffusion-webui\extensions\sd_dreambooth_extension before you take the next step.
Unzip it, and overwrite, to the C:\stable-diffusion\stable-diffusion-webui\extensions\sd_dreambooth_extension directory.
When you next run automatic1111 DON'T UPDATE THE EXTENTIONS. (Hopefully the extension will be updated soon so that it works again for 8Gb cards)

i was using that extension but it doesn't seem to work properly now. results are completely wrong.

AyoKeito · 2023-01-14T00:05:29Z

@ovladuk you were using this exact version? How wrong? If it's just noise and nothing coherent then it's a version mismatch (probably xformers). Just make a clean webui install for dreambooth to another folder.

ovladuk · 2023-01-14T00:09:36Z

@ovladuk you were using this exact version? How wrong? If it's just noise and nothing coherent then it's a version mismatch (probably xformers). Just make a clean webui install for dreambooth to another folder.

yeah the prompts weren't giving me results that matched what i trained. what version of auto 11 should i use for clean install?

AyoKeito · 2023-01-14T00:13:53Z

@ovladuk latest worked for me today.

ovladuk · 2023-01-14T00:21:16Z

@ovladuk latest worked for me today.

well i have latest auto 11 with the extension's you linked cause i already found that previous version a few days ago but it recently started going wrong. so i don't know. don't see any point install a fresh copy of auto 11 of the version I'm already using.

AyoKeito · 2023-01-14T00:26:08Z

@ovladuk unless you experience something like what i get in my prompts #763 (like this) it's most likely incorrect configuration or low quality dataset.
You can also try dynamically load your LoRA using this extension and changing it's strength.

GucciFlipFlops1917 · 2023-01-16T16:40:23Z

You can also try dynamically load your LoRA using this extension and changing it's strength.

I really like that extension but d8ahazard results are natively incompatible. A workaround is to merge to a .ckpt first, then take the difference between that and the original base model using extract_lora_from_models.py. Main benefits are of course that dynamic loading capability, low file size, and the ability to easily apply/test LoRA weights with various larger ckpts. An interesting usage of the last part is to apply LoRA to derivative models.

JasonEpic added the new Just added, you should probably sort this. label Jan 11, 2023

d8ahazard mentioned this issue Jan 20, 2023

So many fixes, so much wow. #806

Merged

d8ahazard closed this as completed Jan 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LORA on 8GB Graphics Cards is broken #755

LORA on 8GB Graphics Cards is broken #755

JasonEpic commented Jan 11, 2023

JasonEpic commented Jan 11, 2023

ovladuk commented Jan 12, 2023

AyoKeito commented Jan 13, 2023 •

edited

ovladuk commented Jan 14, 2023

AyoKeito commented Jan 14, 2023

ovladuk commented Jan 14, 2023

AyoKeito commented Jan 14, 2023

ovladuk commented Jan 14, 2023

AyoKeito commented Jan 14, 2023 •

edited

GucciFlipFlops1917 commented Jan 16, 2023

LORA on 8GB Graphics Cards is broken #755

LORA on 8GB Graphics Cards is broken #755

Comments

JasonEpic commented Jan 11, 2023

Kindly read the entire form below and fill it out with the requested information.

JasonEpic commented Jan 11, 2023

ovladuk commented Jan 12, 2023

AyoKeito commented Jan 13, 2023 • edited

ovladuk commented Jan 14, 2023

AyoKeito commented Jan 14, 2023

ovladuk commented Jan 14, 2023

AyoKeito commented Jan 14, 2023

ovladuk commented Jan 14, 2023

AyoKeito commented Jan 14, 2023 • edited

GucciFlipFlops1917 commented Jan 16, 2023

AyoKeito commented Jan 13, 2023 •

edited

AyoKeito commented Jan 14, 2023 •

edited