-
Notifications
You must be signed in to change notification settings - Fork 61
Issues: huggingface/optimum-quanto
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Only random noise is generated with Flux + LoRA with optimum-quanto >= 0.2.5
#343
opened Oct 30, 2024 by
nelapetrzelkova
Will module output not be quantized when the model is directly trained after Calibration?
#336
opened Oct 11, 2024 by
tusiqi1
Corrupted outputs with Marlin int4 kernels as parallelization increases
bug
Something isn't working
help wanted
Extra attention is needed
#332
opened Oct 6, 2024 by
dacorvo
qin4 inference fails with RuntimeError: Cannot set version_counter for inference tensor
#304
opened Sep 3, 2024 by
BenjaminBossan
Potential Gradient Error when Reloading Frozen Weights in
qmodule.py
_load_from_state_dict
Stale
#293
opened Aug 24, 2024 by
cjfghk5697
Packages created on the CI are missing cpp and cuda extension files
#254
opened Jul 23, 2024 by
dacorvo
Inference from a reload quantized open clip model (by .load_state_dict) resulted in IndexError
Stale
#217
opened Jun 24, 2024 by
kechan
Switch to ruff native formatter
good first issue
Good for newcomers
help wanted
Extra attention is needed
Stale
#186
opened Apr 22, 2024 by
dacorvo
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.