Skip to content

Conversation

@Qubitium
Copy link
Collaborator

@Qubitium Qubitium commented Sep 21, 2025

@nbasyl We just did a major refractor of internals to add AWQ. Just a little cleanup in the EoRA paths. The biggest change is doing all diff in fp32. It appears if the diff in fp16 is very small, it can actually result in 0.0 if we don't upcast to fp32.

We have not tested awq and eora execution paths but I believe eora may also work with awq?

Signed-off-by: Qubitium <Qubitium@modelcloud.ai>
@Qubitium Qubitium changed the title always diff in fp32 + cleanup eora: always diff in fp32 + cleanup Sep 21, 2025
Signed-off-by: Qubitium <Qubitium@modelcloud.ai>
@Qubitium Qubitium merged commit 1df75b0 into main Sep 21, 2025
5 checks passed
@Qubitium Qubitium deleted the clean-eora branch September 21, 2025 13:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants