AttributeError: module 'torch.nn.functional' has no attribute 'scaled_dot_product_attention' #2260

leszekhanusz · 2023-05-21T21:04:44Z

Describe the bug

I would like to use the TheBloke/Wizard-Vicuna-13B-Uncensored-GPTQ model on my RTX3080 with 10GB of VRAM.

It works great at first but once the input gets too big, it does not work anymore because of not enough VRAM.

So I'm trying to use the model with pre_layer set to 20 to move a part of the model to the CPU, but when I do that I receive the following error:

Traceback (most recent call last):
  File "/mnt/tera/git-repos/text-generation-webui/modules/callbacks.py", line 73, in gentask
    ret = self.mfunc(callback=_callback, **self.kwargs)
  File "/mnt/tera/git-repos/text-generation-webui/modules/text_generation.py", line 267, in generate_with_callback
    shared.model.generate(**kwargs)
  File "/mnt/tera/miniconda3/envs/textgen/lib/python3.10/site-packages/torch/autograd/grad_mode.py", line 27, in decorate_context
    return func(*args, **kwargs)
  File "/mnt/tera/miniconda3/envs/textgen/lib/python3.10/site-packages/transformers/generation/utils.py", line 1565, in generate
    return self.sample(
  File "/mnt/tera/miniconda3/envs/textgen/lib/python3.10/site-packages/transformers/generation/utils.py", line 2612, in sample
    outputs = self(
  File "/mnt/tera/miniconda3/envs/textgen/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1194, in _call_impl
    return forward_call(*input, **kwargs)
  File "/mnt/tera/miniconda3/envs/textgen/lib/python3.10/site-packages/transformers/models/llama/modeling_llama.py", line 688, in forward
    outputs = self.model(
  File "/mnt/tera/miniconda3/envs/textgen/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1194, in _call_impl
    return forward_call(*input, **kwargs)
  File "/mnt/tera/git-repos/text-generation-webui/repositories/GPTQ-for-LLaMa/llama_inference_offload.py", line 154, in forward
    layer_outputs = decoder_layer(
  File "/mnt/tera/miniconda3/envs/textgen/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1194, in _call_impl
    return forward_call(*input, **kwargs)
  File "/mnt/tera/miniconda3/envs/textgen/lib/python3.10/site-packages/transformers/models/llama/modeling_llama.py", line 293, in forward
    hidden_states, self_attn_weights, present_key_value = self.self_attn(
  File "/mnt/tera/miniconda3/envs/textgen/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1194, in _call_impl
    return forward_call(*input, **kwargs)
  File "/mnt/tera/git-repos/text-generation-webui/repositories/GPTQ-for-LLaMa/quant/fused_attn.py", line 154, in forward
    attn_output = F.scaled_dot_product_attention(query_states, key_states, value_states, is_causal=is_causal)
AttributeError: module 'torch.nn.functional' has no attribute 'scaled_dot_product_attention'

My version of pytorch is 1.13.1 in that Python environment, and I believe that the scaled_dot_product_attention attribute is only available with a newer version (2.0) of pytorch.

Note that I had to change the GPTQ-for-LLaMa repository in repositories/GPTQ-for-LLaMa to the a recent commit of the qwopqwop200 repo to make this model work.

What should I do? Is it safe to try to upgrade to pytorch 2.0 or will it causes other issues.

Is there an existing issue for this?

I have searched the existing issues

Reproduction

Try to use the above model with pre_layer set to 20 and the GPTQ-for-LLaMa repo set the the commit above.

Screenshot

No response

Logs

N/A

System Info

Linux, RTX3080 10GB

The text was updated successfully, but these errors were encountered:

leszekhanusz · 2023-06-01T20:03:43Z

a simple pip install --upgrade torch to upgrade torch to 2.x solved the problem.

leszekhanusz added the bug Something isn't working label May 21, 2023

leszekhanusz mentioned this issue May 21, 2023

AttributeError: module 'torch.nn.functional' has no attribute 'scaled_dot_product_attention' qwopqwop200/GPTQ-for-LLaMa#239

Closed

leszekhanusz closed this as completed Jun 1, 2023

AmlanSamanta mentioned this issue Jul 27, 2023

Feasibility of using Llama2 LLM on AWS EC2 G4dn.8xLarge and Inferentia 2.8xlarge Instances meta-llama/llama#566

Open

meareabc mentioned this issue Aug 9, 2023

请问使用LORA微调的最低硬件配置是多少 OpenBMB/ToolBench#52

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AttributeError: module 'torch.nn.functional' has no attribute 'scaled_dot_product_attention' #2260

AttributeError: module 'torch.nn.functional' has no attribute 'scaled_dot_product_attention' #2260

leszekhanusz commented May 21, 2023

leszekhanusz commented Jun 1, 2023

AttributeError: module 'torch.nn.functional' has no attribute 'scaled_dot_product_attention' #2260

AttributeError: module 'torch.nn.functional' has no attribute 'scaled_dot_product_attention' #2260

Comments

leszekhanusz commented May 21, 2023

Describe the bug

Is there an existing issue for this?

Reproduction

Screenshot

Logs

System Info

leszekhanusz commented Jun 1, 2023