Neural Magic
Neural Magic helps developers in accelerating machine learning performance using automated model sparsification techniques and inference technologies.
Pinned
Repositories
Showing 10 of 38 repositories
-
- compressed-tensors Public
A safetensors extension to efficiently store sparse quantized tensors on disk
- nm-AutoGPTQ Public Forked from AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
- alpaca_eval Public Forked from tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
-
-
Top languages
Loading…