Skip to content

Conversation

@Qubitium
Copy link
Collaborator

@Qubitium Qubitium commented Oct 1, 2025

@avtc To fix your reported stacktrace on save.

Signed-off-by: Qubitium <Qubitium@modelcloud.ai>
@Qubitium Qubitium merged commit 8db921b into main Oct 1, 2025
5 checks passed
@Qubitium Qubitium deleted the thread-ctx-3 branch October 1, 2025 13:29
@avtc
Copy link
Contributor

avtc commented Oct 1, 2025

Works!

@Qubitium
Copy link
Collaborator Author

Qubitium commented Oct 1, 2025

Works!

Still need to find out why your Q.to() is failing. I even setup a cI test and it couldn't replicate the bug. It doesen't make sense. The Q.to() bug I mean.

@avtc
Copy link
Contributor

avtc commented Oct 1, 2025

Have you tried with GLM-4.5-Air, mock_quantization=True, samples: 1, 5+ gpus? On 3.13.7t?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants