Replies: 1 comment
-
|
I am also seeing this error in logs, but my performance seems suboptimal. However I am running dual RTX4000 Ada, not 3090s. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Now that I can do MTP and mmproj together with split mode tensor all at the same time, I see a nice performance boost on my 3090s. (qwen3.6-27B-q8 now starting out at 60 t/s)
But I do notice:
Is split mode tensor fundamentally incompatible with backend sampling, or is it "just" an issue of the code not having been written yet?
Beta Was this translation helpful? Give feedback.
All reactions