-
-
Notifications
You must be signed in to change notification settings - Fork 282
Issues: turboderp/exllamav2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[BUG] Out of memory from a 2.4bpw 70B parameter model
bug
Something isn't working
#677
opened Nov 19, 2024 by
cmunna0052
3 tasks done
[BUG] Async with Paged Attention Reduces accuracy
bug
Something isn't working
#676
opened Nov 18, 2024 by
rjmehta1993
3 tasks done
[BUG] [Qwen] Draft model produce garbage output
bug
Something isn't working
#674
opened Nov 14, 2024 by
Nepherpitou
3 tasks done
[REQUEST] Convert.py: Option to skip measurement when setting 8.0/8.0
#673
opened Nov 13, 2024 by
Originalimoc
3 tasks done
[PAPER] New quant method with SOTA quality and speed: QTIP
#668
opened Nov 1, 2024 by
TyraVex
3 tasks done
[REQUEST] Alternative way to the Pytorch environment variables on Windows to set Pytorch memory management parameters
#664
opened Oct 29, 2024 by
Nexesenex
3 tasks done
[BUG] AMD - Out of memory errors despite having plenty of VRAM
bug
Something isn't working
#662
opened Oct 27, 2024 by
RSAStudioGames
3 tasks done
[REQUEST] Llama 3.2 Vision Support (or already exists?)
#658
opened Oct 18, 2024 by
grimulkan
3 tasks done
[BUG] Appending-Runtime-LoRA-weights
bug
Something isn't working
#656
opened Oct 16, 2024 by
royallavanya140
3 tasks done
[BUG] Convert script fails to run on Something isn't working
master
branch as of v0.2.3
bug
#655
opened Oct 15, 2024 by
iamwavecut
3 tasks done
[BUG] RAM UTILISATION IS INCREASING RAPIDLY
bug
Something isn't working
#639
opened Sep 25, 2024 by
UTSAV-44
[REQUEST] Is it possible and a lot of trouble to support flux?
#631
opened Sep 22, 2024 by
Ph0rk0z
3 tasks done
[BUG] Random slowdowns in tensor parallel.
bug
Something isn't working
#630
opened Sep 21, 2024 by
Ph0rk0z
3 tasks done
[BUG] Failed to quantize Qwen2.5-Math-72B-Instruct: Measurement/inference error (3): hidden_states
bug
Something isn't working
#627
opened Sep 19, 2024 by
Orion-zhen
3 tasks done
[BUG] Quantization of Qwen return garbage
bug
Something isn't working
#621
opened Sep 10, 2024 by
fahadh4ilyas
3 tasks done
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.