How to create a quantized model in order to try out QNNPACK? #12

fengxueem · 2018-11-02T08:30:46Z

Hi,
I am so excited that Facebook is revealing its own magic on mobile inference framework. After reading the article about QNNPACK, I really want to try it out on my own caffemodel.(I know that you guys have posted quantized mobilenetv2, and it beats the TFLITE one by 2x.) But how can I convert my prototxt and caffemodel to the preferred model format which can be applied to QNNPACK?

Maratyszcza · 2018-11-03T19:47:54Z

We haven't released the tooling for converting floating-point models to quantized (8-bit) models yet. You may use QNNPACK with the two pre-trained models that we released:

ResNet-50: https://github.com/caffe2/models/tree/master/resnet50_quantized
MobileNet v2: https://github.com/caffe2/models/tree/master/mobilenet_v2_quantized

biaoxiaoduan · 2018-12-06T08:17:44Z

I got this error when I tried ResNet-50 model as you mentioned above.
terminate called after throwing an instance of 'c10::Error' what(): [enforce fail at operator.cc:46] blob != nullptr. op NCHW2NHWC: Encountered a non-existing input blob: gpu_0/data_0 (no backtrace available)
mobilenet_v2 is OK

Maratyszcza · 2018-12-06T11:42:40Z

@biaoxiaoduan In quantized ResNet-50 the input blob is called "gpu_0/data_0" rather than "data".

xiezheng-cs · 2018-12-18T13:11:22Z

Hi,
I am so excited that Facebook is revealing its own magic on mobile inference framework. After reading the article about QNNPACK, I really want to try it out on my own caffemodel.(I know that you guys have posted quantized mobilenetv2, and it beats the TFLITE one by 2x.) But how can I convert my prototxt and caffemodel to the preferred model format which can be applied to QNNPACK?

Hi, I have the same problem. Do you have any idea to solve it?

Maratyszcza · 2018-12-18T20:57:06Z

@xiezheng-cs As of today, there's no out-of-the-box solution for Caffe2 or PyTorch, but team is working on it

hardsetting · 2019-02-12T02:33:36Z

Hi, do you have any updates on a tool, or have any suggestions in how to manually convert an onnx model to use quantized 8bit integers?

hlu1 · 2019-02-12T03:45:18Z

ONNX doesn't support quantization as of today. Any chance you can convert the model to caffe2 and quantize it to 8bit using caffe2?

hardsetting · 2019-02-12T19:06:50Z

Do you have any estimate on when quantization will be available on onnx? Our main model was not written for caffe2 unfortunately.

bddppq · 2019-02-12T22:13:34Z

@hardsetting Quantization support in onnx is unfortunately on hold right now, sorry we don't have any eta on it :( .

hardsetting · 2019-02-12T22:43:22Z

Luckily I just managed to convert my model to Caffe2. What about the conversion tool from regular Caffe2 model to quantized Caffe2 model @Maratyszcza was referring to? Do you have an ETA for that or any news?

hlu1 · 2019-02-12T23:07:31Z

The latest ETA is one to two months :(

hardsetting · 2019-02-12T23:46:53Z

Ok, thank you for the answer

zhoujinhai · 2020-07-31T08:03:11Z

Luckily I just managed to convert my model to Caffe2. What about the conversion tool from regular Caffe2 model to quantized Caffe2 model @Maratyszcza was referring to? Do you have an ETA for that or any news?

I have the same problem, Do you have solved it?

Maratyszcza closed this as completed Nov 3, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to create a quantized model in order to try out QNNPACK? #12

How to create a quantized model in order to try out QNNPACK? #12

fengxueem commented Nov 2, 2018

Maratyszcza commented Nov 3, 2018

biaoxiaoduan commented Dec 6, 2018

Maratyszcza commented Dec 6, 2018

xiezheng-cs commented Dec 18, 2018

Maratyszcza commented Dec 18, 2018

hardsetting commented Feb 12, 2019

hlu1 commented Feb 12, 2019

hardsetting commented Feb 12, 2019

bddppq commented Feb 12, 2019

hardsetting commented Feb 12, 2019

hlu1 commented Feb 12, 2019

hardsetting commented Feb 12, 2019

zhoujinhai commented Jul 31, 2020

How to create a quantized model in order to try out QNNPACK? #12

How to create a quantized model in order to try out QNNPACK? #12

Comments

fengxueem commented Nov 2, 2018

Maratyszcza commented Nov 3, 2018

biaoxiaoduan commented Dec 6, 2018

Maratyszcza commented Dec 6, 2018

xiezheng-cs commented Dec 18, 2018

Maratyszcza commented Dec 18, 2018

hardsetting commented Feb 12, 2019

hlu1 commented Feb 12, 2019

hardsetting commented Feb 12, 2019

bddppq commented Feb 12, 2019

hardsetting commented Feb 12, 2019

hlu1 commented Feb 12, 2019

hardsetting commented Feb 12, 2019

zhoujinhai commented Jul 31, 2020