[Bug]: I have an error message when calling the vllm api, and vllm will be closed.vllm:0.91.graphics card:5060TI

### Your current environment

<details>
<summary>The output of <code>python collect_env.py</code></summary>

```text
Your output of `python collect_env.py` here
```

</details>


### 🐛 Describe the bug

RuntimeError: CUDA error: no kernel image is available for execution on the device
2025-06-12 20:49:44 CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
2025-06-12 20:49:44 For debugging consider passing CUDA_LAUNCH_BLOCKING=1
2025-06-12 20:49:44 Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bug]: I have an error message when calling the vllm api, and vllm will be closed.vllm:0.91.graphics card:5060TI #19671

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Bug]: I have an error message when calling the vllm api, and vllm will be closed.vllm:0.91.graphics card:5060TI #19671

Description

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions