-
-
Notifications
You must be signed in to change notification settings - Fork 11.9k
Open
Labels
Description
Your current environment
latest pytorch commit and latest vLLM commit.
🐛 Describe the bug
ok on a cache hit more things are marked dynamic than needed, this seems very similar to what happen in
#27899 (read my last comment)
this is risky because marking more things dynamic with duck shapes can cause silent specialization that are not wanted.
to repo run any model +dynamic logs
For example for "Qwen/Qwen2-7B-Instruct" on a cold run we will see only three things dynamic.
on a warm run we will see so many things marked dynamic.
This is. BAD!
We need to
- disable automatic dynamic.
- on a warm run know what things to be marked dynamic and mark them explicitly.
OR
serialize fake tensor for the warm run!
Before submitting a new issue...
- Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.
Metadata
Metadata
Assignees
Labels
Type
Projects
Status
To triage