Why there are so many reorders in a binary network IR trained from nncf #5817

gj-raza · 2021-05-25T17:11:33Z

System information (version)

OpenVINO = 2021.1.110
Operating System / Platform => ubuntu 18.04
Problem classification: Model Conversion
Framework: NNCF (Pytorch 1.8.1)
Model name: Squeezenet (Classification)
CPU: Intel core i7 9700k

Detailed description

I trained a squeezenet classification model using given training pipeline script in nncf examples with XNOR binarization as compression method, keeping first conv layer as fp32 and rest of the net in binary, converted to onnx and then to IR via model optimizer and ran benchmarks with the OV's benchmark tool and found that in addition to fake quantize, there are reorders happening before every binary conv layer execution which are costly and essentially taking away the speed up that could be achieved through binarization as a result, the FPS are even lower that a full FP32 model (benchmark report attached).

As you can see below before every binary conv, a reorder is happening from nchw8c to nhwc, although as per docs of binary conv here, the binary conv layer can accept NCHW format.

Concern is, why the model optimizer/Inferece Engine is adding reorders before every bin conv and is there a way to avoid these?
Detailed benchmark counters for full FP32 model and binary model are attached for reference.

benchmark_detailed_counters_report_squeezenet1_1_imagenet_binary_xnor.csv
benchmark_detailed_counters_report_squeezenet_1_1_fp32.csv

benchmark_report_squeezenet1_1_imagenet_binary_xnor.csv
benchmark_report_squeezenet_1_1_fp32.csv

ilyachur · 2021-05-26T04:09:15Z

@dmitry-gorokhov Could you have a look?

gj-raza · 2021-05-28T12:41:44Z

@dmitry-gorokhov here are model onnx files for your reference,

https://drive.google.com/drive/folders/1EzkohZOjGg-Cul0IoLok9QJu1NO0GMQ2?usp=sharing

gj-raza · 2021-06-10T14:31:19Z

any updates on this?

zulkifli-halim · 2021-06-11T12:09:34Z

Hello @gj-raza. Thank you for your patience. The developer is investigating the case and this might take some time.

Ref. 57585

jgespino · 2021-12-20T23:09:53Z

Hi @gj-raza

I apologize for the delay in our response. The development team has confirmed that Binary models support is currently experimental in OpenVINO. Additional optimizations still need to be implemented in order to speed up such binary models.

I will convert this issue into a feature request.

Regards,
Jesus

hbalasu1 · 2023-04-05T08:09:04Z

Hi @gj-raza, thank you for your feedback on this case,
I am closing this case as it won't fix . This is because binary networks are currently a low priority for OpenVINO.

gj-raza added bug Something isn't working support_request labels May 25, 2021

ilyachur assigned dmitry-gorokhov May 26, 2021

jgespino added the category: MO Model Optimizer label Jun 29, 2021

jgespino added the PSE label Jul 13, 2021

jgespino self-assigned this Jul 13, 2021

jgespino added feature New feature request and removed bug Something isn't working labels Dec 20, 2021

jgespino removed the PSE label Jun 15, 2022

jgespino removed their assignment Nov 7, 2022

avitial removed the support_request label Jan 10, 2023

hbalasu1 closed this as completed Apr 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why there are so many reorders in a binary network IR trained from nncf #5817

Why there are so many reorders in a binary network IR trained from nncf #5817

gj-raza commented May 25, 2021

ilyachur commented May 26, 2021

gj-raza commented May 28, 2021

gj-raza commented Jun 10, 2021

zulkifli-halim commented Jun 11, 2021 •

edited

jgespino commented Dec 20, 2021

hbalasu1 commented Apr 5, 2023

Why there are so many reorders in a binary network IR trained from nncf #5817

Why there are so many reorders in a binary network IR trained from nncf #5817

Comments

gj-raza commented May 25, 2021

System information (version)

Detailed description

ilyachur commented May 26, 2021

gj-raza commented May 28, 2021

gj-raza commented Jun 10, 2021

zulkifli-halim commented Jun 11, 2021 • edited

jgespino commented Dec 20, 2021

hbalasu1 commented Apr 5, 2023

zulkifli-halim commented Jun 11, 2021 •

edited