Skip to content

Conversation

@ochougul
Copy link
Contributor

No description provided.

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>
@ochougul ochougul changed the title Enabled repeat KV heads for AWQ/GPTQ models Enabled repeat KV heads for AWQ models Nov 24, 2024
Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>
Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>
Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>
@ochougul ochougul self-assigned this Nov 25, 2024
@ochougul ochougul added enhancement New feature or request quantization labels Nov 25, 2024
@ochougul ochougul merged commit d6f2a1a into quic:main Nov 25, 2024
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request quantization

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants