[Bugfix] Fine-tune gptq_marlin configs to be more similar to marlin #4626

alexm-neuralmagic · 2024-05-06T16:05:41Z

test_gptq_marlin.py compares "gptq" outputs vs "gptq_marlin" outputs. However, sometimes, they can diverge a bit in their outputs. This PR ensures gptq_marlin uses more similar K/N breakdown configs to the original marlin to make the output more similar to gptq.

mgoin

Nice job, LGTM!

…llm-project#4626)

fine-tune gptq_marlin configs to be more similar to marlin

10b7d36

mgoin approved these changes May 7, 2024

View reviewed changes

robertgshaw2-neuralmagic enabled auto-merge (squash) May 9, 2024 00:12

ywang96 approved these changes May 9, 2024

View reviewed changes

robertgshaw2-neuralmagic merged commit e288df0 into vllm-project:main May 9, 2024
59 checks passed

robertgshaw2-neuralmagic deleted the marlin_8bit_fix branch May 9, 2024 00:14

z103cb pushed a commit to z103cb/opendatahub_vllm that referenced this pull request May 9, 2024

[Bugfix] Fine-tune gptq_marlin configs to be more similar to marlin (v…

626f3f4

…llm-project#4626)

robertgshaw2-neuralmagic pushed a commit to neuralmagic/nm-vllm that referenced this pull request May 19, 2024

[Bugfix] Fine-tune gptq_marlin configs to be more similar to marlin (v…

32314e5

…llm-project#4626)

dtrifiro pushed a commit to dtrifiro/vllm that referenced this pull request May 21, 2024

[Bugfix] Fine-tune gptq_marlin configs to be more similar to marlin (v…

53a9503

…llm-project#4626)

Temirulan pushed a commit to Temirulan/vllm-whisper that referenced this pull request Sep 6, 2024

[Bugfix] Fine-tune gptq_marlin configs to be more similar to marlin (v…

571e94d

…llm-project#4626)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix] Fine-tune gptq_marlin configs to be more similar to marlin #4626

[Bugfix] Fine-tune gptq_marlin configs to be more similar to marlin #4626

alexm-neuralmagic commented May 6, 2024

mgoin left a comment

[Bugfix] Fine-tune gptq_marlin configs to be more similar to marlin #4626

[Bugfix] Fine-tune gptq_marlin configs to be more similar to marlin #4626

Conversation

alexm-neuralmagic commented May 6, 2024

mgoin left a comment

Choose a reason for hiding this comment