Skip to content

Conversation

@Qubitium
Copy link
Collaborator

@Qubitium Qubitium commented Sep 10, 2025

@avtc Please test this code which should prevent the cuda async assert errors in 10 gpu + moe + mock quant. Pretty sure there is a limit within cuda that only allows 1 sync operation per device.

Oof.. I merged this PR by mistake. Please test and if ok, I will leave it be and if not, I will revert.

@Qubitium Qubitium merged commit 37f2f6b into main Sep 10, 2025
5 checks passed
@Qubitium
Copy link
Collaborator Author

@LRL-ModelCloud Pleasea also test this PR asap and I will revert if there are issues.

@avtc
Copy link
Contributor

avtc commented Sep 10, 2025

@Qubitium it works OK

@Qubitium Qubitium deleted the Qubitium-patch-7 branch September 11, 2025 22:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants