Tags · vllm-project/llm-compressor

0.4.1

Update gemma2 examples with a note about sample generation (#1176)

SUMMARY:
- Add a note advising users to either downgrade transformers from 4.49
or use vLLM for generation
- We should revisit why this is only happening on generation with this
new release but can be revisited down the road

Feb 19, 2025
6a1ba3c
zip
tar.gz
Notes
Downloads

0.4.0

bump; set ct version (#1076)

SUMMARY:
"please provide a brief summary"


TEST PLAN:
"please outline how the changes were tested"

Jan 15, 2025
829af5b
zip
tar.gz
Notes
Downloads

0.3.1

update version (#969)

* update version

* pin ct version

Dec 11, 2024
c3608a0
zip
tar.gz
Notes
Downloads

0.3.0

bump version (#907)

Signed-off-by: Dipika <dipikasikka1@gmail.com>

Nov 12, 2024
93832a6
zip
tar.gz
Notes
Downloads

0.2.0

Update MoE examples (#192)

* Update MoE examples

* Add top-level link

* Fix deepseek_moe_w8a8_int8.py

* Add deepseek_moe_w8a8_fp8.py

* Quality

* Quality

Sep 23, 2024
2e0035f
zip
tar.gz
Notes
Downloads

0.1.0

Offloading Bug Fix (#58)

* fix fstring

* fix offloaded sparsity calculation

Aug 6, 2024
066d1e4
zip
tar.gz
Notes
Downloads

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0.4.1

0.4.0

0.3.1

0.3.0

0.2.0

0.1.0

Tags: vllm-project/llm-compressor

0.4.1

0.4.0

0.3.1

0.3.0

0.2.0

0.1.0