fix: CUDA error when inferencing with Falcon-40B base model #992

kyujin-cho · 2023-09-08T18:03:06Z

This PR resolves #767. As described on #767 (comment), referencing model_type instance variable results in wrongly determining model variant. Until there's some movement to fix this behavior on huggingface side, this patch will be a simple workaround to get rid of the CUDA error.

justinphan3110cais · 2023-09-09T21:47:00Z

I got

    self.hf_config.to_dict().["model_type"] == "falcon"
                             ^
SyntaxError: invalid syntax

WoosukKwon

Hi @kyujin-cho, thanks for submitting the PR! I've made a small change before merging it.

@justinphan3110cais Thanks for reporting the bug!

…ject#992)

fix: CUDA error when inferencing with Falcon-40B base model

1f6d751

kyujin-cho mentioned this pull request Sep 8, 2023

CUDA error: an illegal memory acces with Falcon 40B #767

Closed

Fix

0c3e0e1

WoosukKwon self-requested a review September 10, 2023 08:38

WoosukKwon approved these changes Sep 10, 2023

View reviewed changes

WoosukKwon merged commit 898285c into vllm-project:main Sep 10, 2023

kyujin-cho deleted the fix/falcon-40b-cuda-error branch September 10, 2023 08:39

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

fix: CUDA error when inferencing with Falcon-40B base model (vllm-pro…

f4b057c

…ject#992)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix: CUDA error when inferencing with Falcon-40B base model #992

fix: CUDA error when inferencing with Falcon-40B base model #992

Uh oh!

kyujin-cho commented Sep 8, 2023

Uh oh!

justinphan3110cais commented Sep 9, 2023

Uh oh!

WoosukKwon left a comment

Uh oh!

Uh oh!

Uh oh!

fix: CUDA error when inferencing with Falcon-40B base model #992

fix: CUDA error when inferencing with Falcon-40B base model #992

Uh oh!

Conversation

kyujin-cho commented Sep 8, 2023

Uh oh!

justinphan3110cais commented Sep 9, 2023

Uh oh!

WoosukKwon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!