add simple demo ppl test with wikitext2 #17

qwopqwop200 · 2023-04-26T08:34:07Z

This is a demo code using ppl to test the performance of AutoGPTQ.
opt - 125m
RTN baseline 37.27 ppl
GPTQ for llama 29.21 ppl
AutoGPTQ 29.87 ppl
Some performance degradation is observed, but it seems to be a minor error.

PanQiWei · 2023-04-26T09:45:09Z

Yeah, I can also confirm that there is a small gap between AutoGPTQ and GPTQ-for-LLaMa by running this example.

I'm not quite sure if it's because AutoGPTQ's logic used in model.quant() function and how attention_mask been processed is slightly different with GPTQ-for-LLaMa.

This finding is important, maybe there's room for improvement here.

I think this pr can be merged.

add simple demo ppl test with wikitext2

2d85af5

PanQiWei approved these changes Apr 26, 2023

View reviewed changes

update README.md

e5e9296

PanQiWei merged commit 5a70052 into main Apr 26, 2023

qwopqwop200 deleted the simple-demo branch April 26, 2023 12:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add simple demo ppl test with wikitext2 #17

add simple demo ppl test with wikitext2 #17

qwopqwop200 commented Apr 26, 2023 •

edited

PanQiWei commented Apr 26, 2023

add simple demo ppl test with wikitext2 #17

add simple demo ppl test with wikitext2 #17

Conversation

qwopqwop200 commented Apr 26, 2023 • edited

PanQiWei commented Apr 26, 2023

qwopqwop200 commented Apr 26, 2023 •

edited