Skip to content

How to choose the quantization method #2592

Answered by alexsu52
camhpj asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @camhpj,

nncf.quantize_with_accuracy_control does not yet support the PyTorch model directly, instead you can export a PyTorch model to OpenVINO or ONNX and run nncf.quantize_with_accuracy_control on the exported model.

OpenVINO example: https://github.com/openvinotoolkit/nncf/tree/develop/examples/post_training_quantization/openvino/yolov8_quantize_with_accuracy_control
ONNX example: https://github.com/openvinotoolkit/nncf/tree/develop/examples/post_training_quantization/onnx/yolov8_quantize_with_accuracy_control

nncf.quantize_with_accuracy_control controls of accuracy metric by keeping the most impactful operations within the model in the original precision (OpenVINO documentation). …

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@camhpj
Comment options

Answer selected by camhpj
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants