RuntimeError: CUDA error: invalid argument #50

i2chris · 2022-11-08T23:57:27Z

I'm running this dreambooth extension using all the default settings and only changing these three settings:

Instance prompt: photo of florich girl
Class prompt: photo of girl
Dataset directory: D:\images\flo-output

When I run this I get the error "RuntimeError: CUDA error: invalid argument"

Is there something obvious causing this?

Running on 3090 24GB

Arguments: ('florich', 'D:\\images\\flo-output', '', 'photo of florich girl', 'photo of girl', '', '', 1.0, 7.5, 40.0, 0, 512, False, True, 1, 1, 1, 1000, 1, True, 5e-06, False, 'constant', 0, False, 0.9, 0.999, 0.01, 1e-08, 1, 500, 500, 'no', True, '', False, True, True) {}
Traceback (most recent call last):
  File "C:\github\stable-diffusion-webui\modules\[ui.py](http://ui.py/)", line 185, in f
    res = list(func(*args, **kwargs))
  File "C:\github\stable-diffusion-webui\[webui.py](http://webui.py/)", line 54, in f
    res = func(*args, **kwargs)
  File "C:\github\stable-diffusion-webui\extensions\sd_dreambooth_extension\dreambooth\[dreambooth.py](http://dreambooth.py/)", line 265, in start_training
    trained_steps = main(config)
  File "C:\github\stable-diffusion-webui\extensions\sd_dreambooth_extension\dreambooth\[train_dreambooth.py](http://train_dreambooth.py/)", line 790, in main
    accelerator.backward(loss)
  File "C:\github\stable-diffusion-webui\venv\lib\site-packages\accelerate\[accelerator.py](http://accelerator.py/)", line 884, in backward
    loss.backward(**kwargs)
  File "C:\github\stable-diffusion-webui\venv\lib\site-packages\torch\[_tensor.py](http://_tensor.py/)", line 396, in backward
    torch.autograd.backward(self, gradient, retain_graph, create_graph, inputs=inputs)
  File "C:\github\stable-diffusion-webui\venv\lib\site-packages\torch\autograd\[__init__.py](http://__init__.py/)", line 173, in backward
    Variable._execution_engine.run_backward(  # Calls into the C++ engine to run the backward pass
  File "C:\github\stable-diffusion-webui\venv\lib\site-packages\torch\autograd\[function.py](http://function.py/)", line 253, in apply
    return user_fn(self, *args)
  File "C:\github\stable-diffusion-webui\venv\lib\site-packages\torch\utils\[checkpoint.py](http://checkpoint.py/)", line 146, in backward
    torch.autograd.backward(outputs_with_grad, args_with_grad)
  File "C:\github\stable-diffusion-webui\venv\lib\site-packages\torch\autograd\[__init__.py](http://__init__.py/)", line 173, in backward
    Variable._execution_engine.run_backward(  # Calls into the C++ engine to run the backward pass
  File "C:\github\stable-diffusion-webui\venv\lib\site-packages\torch\autograd\[function.py](http://function.py/)", line 253, in apply
    return user_fn(self, *args)
  File "C:\github\stable-diffusion-webui\venv\lib\site-packages\xformers\[ops.py](http://ops.py/)", line 369, in backward
    ) = torch.ops.xformers.efficient_attention_backward_cutlass(
  File "C:\github\stable-diffusion-webui\venv\lib\site-packages\torch\[_ops.py](http://_ops.py/)", line 143, in __call__
    return self._op(*args, **kwargs or {})
RuntimeError: CUDA error: invalid argument
CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.

The text was updated successfully, but these errors were encountered:

zark119 · 2022-11-09T05:33:42Z

did you add --xformers to commandline args?

Pythonpa · 2022-11-09T05:39:35Z

So, is this need to add --xformers to commandline args or not?

zark119 · 2022-11-09T05:42:27Z

So, is this need to add --xformers to commandline args or not?

without --xformers, I had this error.

seihoukei · 2022-11-09T05:53:46Z

I have same error with and without --xformers.

seihoukei · 2022-11-09T08:03:22Z

This is a duplicate of issue #48 , I'll duplicate my finding - enabling adam and fp16 precision made it work, so it's default value for either that causes error.

i2chris · 2022-11-09T08:59:55Z

Yes, --xformers is added to commandline.

d8ahazard · 2022-11-10T16:29:47Z

Closing as a duplicate of #48. I'll leave that one open for now until we establish this is fully resolved.

d8ahazard closed this as completed Nov 10, 2022

d8ahazard mentioned this issue Nov 19, 2022

Exception while training: CUDA error: invalid argument CUDA kernel errors might be asynchronously reported #242

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RuntimeError: CUDA error: invalid argument #50

RuntimeError: CUDA error: invalid argument #50

i2chris commented Nov 8, 2022 •

edited

Loading

zark119 commented Nov 9, 2022

Pythonpa commented Nov 9, 2022

zark119 commented Nov 9, 2022

seihoukei commented Nov 9, 2022 •

edited

Loading

seihoukei commented Nov 9, 2022

i2chris commented Nov 9, 2022

d8ahazard commented Nov 10, 2022

RuntimeError: CUDA error: invalid argument #50

RuntimeError: CUDA error: invalid argument #50

Comments

i2chris commented Nov 8, 2022 • edited Loading

zark119 commented Nov 9, 2022

Pythonpa commented Nov 9, 2022

zark119 commented Nov 9, 2022

seihoukei commented Nov 9, 2022 • edited Loading

seihoukei commented Nov 9, 2022

i2chris commented Nov 9, 2022

d8ahazard commented Nov 10, 2022

i2chris commented Nov 8, 2022 •

edited

Loading

seihoukei commented Nov 9, 2022 •

edited

Loading