How long does it take to quantize? #32

fahadh4ilyas · 2024-03-01T07:17:22Z

I'm been using quantization tools like GPTQ, Exllama, or QUIP#. Those tools is quite fast to do quantization in a single A6000 gpu. But, this tool takes a really long time even though I'm using two A6000 gpu. How long does it take for quantizing Mistral 7B using two A6000 gpu and this parameters:

python main.py ../models/my-mistral-7B wikitext2 --nsamples=1024 --num_codebooks=1 --nbits_per_codebook=16 --in_group_size=8 --relative_mse_tolerance=0.01 --finetune_relative_mse_tolerance=0.001 --finetune_batch_size=32 --local_batch_size=1 --save ../models/my-mistral-7B-AQLM --model_seqlen 8192 --offload_activations

iamwavecut · 2024-03-02T22:46:40Z

See #28

github-actions · 2024-04-02T01:45:49Z

This issue is stale because it has been open for 30 days with no activity.

github-actions · 2024-04-17T01:45:17Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

github-actions bot added the stale label Apr 2, 2024

github-actions bot closed this as completed Apr 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How long does it take to quantize? #32

How long does it take to quantize? #32

fahadh4ilyas commented Mar 1, 2024

iamwavecut commented Mar 2, 2024

github-actions bot commented Apr 2, 2024

github-actions bot commented Apr 17, 2024

How long does it take to quantize? #32

How long does it take to quantize? #32

Comments

fahadh4ilyas commented Mar 1, 2024

iamwavecut commented Mar 2, 2024

github-actions bot commented Apr 2, 2024

github-actions bot commented Apr 17, 2024