RuntimeError: CUDA error: CUBLAS_STATUS_INVALID_VALUE when calling `cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)` #106

aleemsidra · 2023-07-28T13:06:13Z

Hi! I am trying to use LoRA for my convolution layers: self.conv = Conv2d(1, 16, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False). I used lora counterpart of nn. Conv2D aslora.Conv2d(n_chans_in, n, self.kernel_size, padding=self.padding, bias=False, r=2, lora_alpha= 2)

The shape of tensors is: x.shape = torch.Size([32, 1, 256, 256]), self.lora_B = torch.Size([48, 6]), self.lora_A.shape = torch.Size([6, 3]).

The part (self.lora_B @ self.lora_A).view(self.conv.weight.shape) faces the following issue:

/Documents/Domain_Apatation/UDAS/src/LoRA/loralib/layers.py:315, in forward(self, x)
    312 if self.r > 0 and not self.merged:
    313     return self.conv._conv_forward(
--> 314         x, 
    315 
    316         self.conv.weight + (self.lora_B @ self.lora_A).view(self.conv.weight.shape) * self.scaling,
    317         self.conv.bias
    318     )
    319 return self.conv(x)

RuntimeError: CUDA error: CUBLAS_STATUS_INVALID_VALUE when calling `cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)

The number of columns in self.lora_B is 6 and number of rowsself.lora_Ais 6, which makes matrix multiplication valid. But still I face this issue. Can you please help me resolve this bug.

The text was updated successfully, but these errors were encountered:

edwardjhu · 2023-07-28T23:23:08Z

Can you try this operation on CPU to exclude GPU-related issues?

aleemsidra · 2023-07-31T12:09:12Z

@edwardjhu I did that as:

lora_b =self.lora_B.detach().cpu()
lora_b.shape
(48, 6)

lora_a =self.lora_A.detach().cpu()
lora_a.shape
(6, 3)

Given the dimensions lora_b @ lora_a are comaptible for matrix multiplication.

self.conv.weight.shape
torch.Size([16, 1, 3, 3])

Now I tested the following by replacing view with reshape, it worked

a = self.conv._conv_forward(x.detach().cpu(), self.conv.weight.detach().cpu() + (self.lora_B.detach().cpu() @ self.lora_A.detach().cpu()).reshape(self.conv.weight.detach().cpu().shape) * self.
    ...: scaling,  self.conv.bias )

I want to understand that why this thing didnot work on CUDA. since inputs are all the same? I would like to process my computation on GPU.

edwardjhu · 2023-08-05T17:05:04Z

Does reshape resolve the issue on GPU as well?

aleemsidra · 2023-08-09T15:27:06Z

no.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RuntimeError: CUDA error: CUBLAS_STATUS_INVALID_VALUE when calling `cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)` #106

RuntimeError: CUDA error: CUBLAS_STATUS_INVALID_VALUE when calling `cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)` #106

aleemsidra commented Jul 28, 2023 •

edited

Loading

edwardjhu commented Jul 28, 2023

aleemsidra commented Jul 31, 2023 •

edited

Loading

edwardjhu commented Aug 5, 2023

aleemsidra commented Aug 9, 2023

RuntimeError: CUDA error: CUBLAS_STATUS_INVALID_VALUE when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc) #106

RuntimeError: CUDA error: CUBLAS_STATUS_INVALID_VALUE when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc) #106

Comments

aleemsidra commented Jul 28, 2023 • edited Loading

edwardjhu commented Jul 28, 2023

aleemsidra commented Jul 31, 2023 • edited Loading

edwardjhu commented Aug 5, 2023

aleemsidra commented Aug 9, 2023

RuntimeError: CUDA error: CUBLAS_STATUS_INVALID_VALUE when calling `cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)` #106

RuntimeError: CUDA error: CUBLAS_STATUS_INVALID_VALUE when calling `cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)` #106

aleemsidra commented Jul 28, 2023 •

edited

Loading

aleemsidra commented Jul 31, 2023 •

edited

Loading