You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fused modules were moved from the original TinyChat into the core of AWQ. Work on generalizing the code better with the parameters in the fused modules.
Fused modules were moved from the original TinyChat into the core of AWQ. Work on generalizing the code better with the parameters in the fused modules.
A few things that need cleaning:
https://github.com/casper-hansen/AutoAWQ/blob/main/awq/modules/fused_attn.py#L12
https://github.com/casper-hansen/AutoAWQ/blob/main/awq/modules/fused_attn.py#L107
https://github.com/casper-hansen/AutoAWQ/blob/main/awq/modules/fused_attn.py#L139
https://github.com/casper-hansen/AutoAWQ/blob/main/awq/modules/fused_mlp.py#L79C8-L79C31
https://github.com/casper-hansen/AutoAWQ/blob/main/awq/modules/fused_norm.py#L27
Additionally, move the code for
isinstance(m, LlamaMLP)
into the actual model class instead.The text was updated successfully, but these errors were encountered: