Skip to content

Latest commit

History

History
19 lines (12 loc) 路 1.08 KB

optimum.md

File metadata and controls

19 lines (12 loc) 路 1.08 KB

Optimum

The Optimum library supports quantization for Intel, Furiosa, ONNX Runtime, GPTQ, and lower-level PyTorch quantization functions. Consider using Optimum for quantization if you're using specific and optimized hardware like Intel CPUs, Furiosa NPUs or a model accelerator like ONNX Runtime.