-
Notifications
You must be signed in to change notification settings - Fork 414
Pull requests: AutoGPTQ/AutoGPTQ
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Supporting uint4 inference of pre-quantized models in HPU
#689
opened Jun 17, 2024 by
HolyFalafel
Loading…
[BUG]Replace
"python"
with sys.executable
in setup.py
#686
opened Jun 14, 2024 by
AnirudhRahul
Loading…
[PERFORMANCE] Fix Packing thread regression in code
#642
opened Apr 16, 2024 by
Qubitium
Loading…
2 tasks done
[BUG/FEATURE] Fix Sym=False, new checkpoint_format = gptq_v2
#640
opened Apr 12, 2024 by
Qubitium
Loading…
29 tasks done
[Minor] peft bug fix: HF peft version and tokenizer path in peft scripts
#493
opened Dec 24, 2023 by
realAsma
Loading…
Allow specifying GPU used for quantisation, overriding hardcoded cuda:0
#405
opened Nov 5, 2023 by
TheBloke
Loading…
Fix dtype mismatch using triton kernels for training Llama2 LoRA
#268
opened Aug 19, 2023 by
briansemrau
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.