DNN/ONNX: outputs registration regression, feature request for new version of Clip operator #21698

cesarpgouveia · 2022-03-07T20:46:56Z

System information (version)

OpenCV => 4.5.5
Operating System / Platform => Windows 64 Bit
Compiler => Visual Studio 2019

Detailed description

I tried to infer using a Selfie Segmenter ONNX model (you can find the model here: https://github.com/PINTO0309/PINTO_model_zoo/tree/main/109_Selfie_Segmentation), however I get Nan on all output values.

Steps to reproduce

You can replicate this issue simply by running this simple script with OpenCV 4.5.5:

#include <iostream>

#include <opencv2/imgcodecs.hpp>
#include <opencv2/imgproc.hpp>
#include <opencv2/dnn.hpp>
#include <opencv2/core.hpp>

int main()
{
    cv::Size inputSizeNewBarracuda = cv::Size(256, 256);

    std::string imagefilename = "C:/Lixo/SantaNoel.jpg";
    std::string newBarracuda = "C:/Users/cesar.gouveia/Downloads/saved_model_openvino/model_float32.onnx";

    cv::dnn::Net net = cv::dnn::readNetFromONNX(newBarracuda);
    net.setPreferableBackend(cv::dnn::DNN_BACKEND_OPENCV);
    net.setPreferableTarget(cv::dnn::DNN_TARGET_CPU);

    cv::Mat img = cv::imread(imagefilename);
    cv::Mat resized;
    cv::resize(img, resized, inputSizeNewBarracuda);

    std::vector<cv::Mat> imgBatch = { resized };
    bool swapRBChannels = false;
    cv::Mat blob = cv::dnn::blobFromImages(imgBatch, 1.0, cv::Size(), cv::Scalar(), swapRBChannels, false, CV_32F);
    blob = blob.reshape(1, { 1, inputSizeNewBarracuda.height, inputSizeNewBarracuda.width, 3 }); // because the model has input in channels last

    net.setInput(blob);

    std::vector<cv::Mat> outputs;
    outputs.clear();

    std::vector<cv::String> unconnectedOutLayerNames = net.getUnconnectedOutLayersNames();
    net.forward(outputs, unconnectedOutLayerNames);

    const cv::Mat& targetMat = outputs[0];
    const float* targetBuffer = (float*)targetMat.data;

    std::cout << targetMat.size[0] << std::endl; // 1
    std::cout << targetMat.size[1] << std::endl; // 256
    std::cout << targetMat.size[2] << std::endl; // 256
    std::cout << targetMat.size[3] << std::endl; // 1

    // Access contiguos buffer
    for (size_t i = 0; i < 256 * 256; i++)
    {
        std::cout << targetBuffer[i] << std::endl;
    }

    // Access Mat
    targetMat.reshape(1, { 1, 256, 256 });
    for (size_t o = 0; o < targetMat.size[0]; o++)
    {
        for (size_t i = 0; i < targetMat.size[1]; i++)
        {
            for (size_t j = 0; j < outputs[0].size[2]; j++)
            {
                std::cout << outputs[0].at<float>(o, i, j);
            }
        }
    }
    
    std::cout << "Finished" << std::endl;
}

The model can be downloaded by the link I provided. This is the first "channels last" model that I use with OpenCVDNN, all my other models are channel first and I never had this code behavior before. I tried to access the mat and the contiguos memory array but neither of them worked.

Thanks,
César.

The text was updated successfully, but these errors were encountered:

berak · 2022-03-08T14:32:50Z

"channels last" model

are you sure about this ?
the pdf here says so, but this might simply not be accurate,
it also states, there would be 2 output layers, but i can find only one
also, if you look at the IE xml files in the zip, those clearly have BCHW order

i checked your code using the model_float32.pb and got decent results without reordering the blob channels

however, i could NOT import the model_float32.onnx in 4.5.5-dev:

[ERROR:0@0.693] global C:\p\opencv\modules\dnn\src\onnx\onnx_importer.cpp (909) handleNode DNN/ONNX: ERROR during processing node with 3 inputs and 1 outputs: [Clip]:(Relu6:0) from domain='ai.onnx'
OpenCV: terminate handler is called! The last OpenCV error is:
OpenCV(4.5.5-dev) Error: Unspecified error (> Node [Clip@ai.onnx]:(Relu6:0) parse error: OpenCV(4.5.5-dev) C:\p\opencv\modules\dnn\src\onnx\onnx_importer.cpp:1613: error: (-2:Unspecified error) in function 'void cv::dnn::dnn4_v20211220::ONNXImporter::parseClip(cv::dnn::dnn4_v20211220::LayerParams&, const opencv_onnx::NodeProto&)'
> >  (expected: 'node_proto.input_size() == 1'), where
> >     'node_proto.input_size()' is 3
> > must be equal to
> >     '1' is 1
> ) in handleNode, file C:\p\opencv\modules\dnn\src\onnx\onnx_importer.cpp, line 928

while this works ok with a previous one (pip install opencv-python==4.5.4.60):

>>> import cv2
>>> print(cv2.__version__)
4.5.4
>>> cv2.dnn.readNet("model_float32.onnx")
<dnn_Net 0x7f99b27e99b0>

there seems to be some regression here

@cesarpgouveia , can you check the version again ? sure it's 4.5.5 ?

cesarpgouveia · 2022-03-08T14:50:38Z

are you sure about this ?
the pdf here says so, but this might simply not be accurate,
it also states, there would be 2 output layers, but i can find only one
also, if you look at the IE xml files in the zip, those clearly have BCHW order

Yes, if you load the onnx model to netron for example you will see that the input is channels last, you can check it on the image bellow:

i checked your code using the model_float32.pb and got decent results without reordering the blob channels

I didn't tried with the tensorflow model but that might be a good option! I will try it today and get back to you.

@cesarpgouveia , can you check the version again ? sure it's 4.5.5 ?

Yes, it's release 4.5.5 with OpenVINO.

berak · 2022-03-08T16:17:34Z

yea, seems you're right about the onnx, would not have expected such fundamental differences between different exports of the same model..

however, this seems wrong:

blob = blob.reshape(1, { 1, inputSizeNewBarracuda.height, inputSizeNewBarracuda.width, 3 }); // because the model has input in channels last

you cant simply reshape from BCHW to BHWC, memory needs to be reshuffled (like a transpose or permute op)
maybe you can just avoid the blobFromImages() and setup your blob like:

resized.convertTo( resized, CV_32F );
int sz[] = {1, inputSizeNewBarracuda.height, inputSizeNewBarracuda.width, 3};
Mat blob(sz, 4, CV_32F, resized.ptr<float>(0));

cesarpgouveia · 2022-03-08T16:57:07Z

you cant simply reshape from BCHW to BHWC, memory needs to be reshuffled (like a transpose or permute op)

Yes you are right, this needs to be reshuffled off course. However, the worst that could happen in this case is that the results would not match, however I get only nan, which suggests that the problem is something else I think.

I tried using your approach:

Mat blob(sz, 4, CV_32F, resized.ptr(0));

I just changed dimensions and sz because I think they were swapped.

Unfortunately I get the same output, a mat filled with nan.

Do you have some more ideias on why this could be happening? And thank you very much for the help you are providing!

cesarpgouveia · 2022-03-09T10:24:49Z

I didn't tried with the tensorflow model but that might be a good option! I will try it today and get back to you.

I tried with the pb model and yes the inference runs with no problems.

berak · 2022-03-10T12:36:51Z

yep, just give it an input scale factor of 1.0/255, then it makes less noise (almost perfect mask)
i also tried the onnx version on colab -- all fine using 4.5.4. --
just cannot load it into a more recent cv2 !

asmorkalov · 2022-03-11T05:48:37Z

@rogday could you take a look on the issue.

rogday · 2022-03-14T10:42:13Z

@cesarpgouveia, This network doesn't work on the current master for the following reasons:

Something happened in new output handling (3.4) dnn: support outputs registration under new names #21540, which needs further inverstigation.
We never supported Clip with 3 inputs - now there is an assert for that and min/max attributes are set properly(as per previous version of Clip operator). It worked before because somewhere down the road our min/max defaults happened to be the same as yours.
Assert was added after the fix for default values, so your version of OpenCV doesn't yet contain it, so you effectively have no Clip, hence NaNs.

asmorkalov added category: dnn (onnx) ONNX suport issues in DNN module category: dnn labels Mar 11, 2022

asmorkalov assigned rogday Mar 11, 2022

rogday added the confirmed There is stable reproducer / investigation complete label Mar 14, 2022

rogday changed the title ~~Can't infer using Selfie Segmenter (MobileNetV3-like) ONNX using OpenCV 4.5.5~~ DNN/ONNX: outputs registration regression, feature request for new version of Clip operator Mar 14, 2022

rogday mentioned this issue Mar 14, 2022

(3.4) dnn: support outputs registration under new names #21540

Merged

2 tasks

asmorkalov mentioned this issue Mar 16, 2022

Run tensorflow model through inference engine on OpenCVDNN #21706

Open

rogday mentioned this issue Apr 4, 2022

Add prefixes to layer names and layer output names #21818

Merged

6 tasks

rogday linked a pull request Apr 4, 2022 that will close this issue

Add prefixes to layer names and layer output names #21818

Merged

6 tasks

rogday removed a link to a pull request Apr 4, 2022

Add prefixes to layer names and layer output names #21818

Merged

6 tasks

ctcx mentioned this issue Jul 15, 2022

Unable to load model ONNX 4.5.5 or 4.6.0 #22247

Closed

4 tasks

cocoa-xu mentioned this issue Dec 14, 2022

[smartcell] added smart cells for models in opencv_zoo cocoa-xu/evision#141

Closed

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DNN/ONNX: outputs registration regression, feature request for new version of Clip operator #21698

DNN/ONNX: outputs registration regression, feature request for new version of Clip operator #21698

cesarpgouveia commented Mar 7, 2022

berak commented Mar 8, 2022

cesarpgouveia commented Mar 8, 2022

berak commented Mar 8, 2022

cesarpgouveia commented Mar 8, 2022 •

edited

cesarpgouveia commented Mar 9, 2022

berak commented Mar 10, 2022

asmorkalov commented Mar 11, 2022

rogday commented Mar 14, 2022

DNN/ONNX: outputs registration regression, feature request for new version of Clip operator #21698

DNN/ONNX: outputs registration regression, feature request for new version of Clip operator #21698

Comments

cesarpgouveia commented Mar 7, 2022

System information (version)

Detailed description

Steps to reproduce

berak commented Mar 8, 2022

cesarpgouveia commented Mar 8, 2022

berak commented Mar 8, 2022

cesarpgouveia commented Mar 8, 2022 • edited

cesarpgouveia commented Mar 9, 2022

berak commented Mar 10, 2022

asmorkalov commented Mar 11, 2022

rogday commented Mar 14, 2022

cesarpgouveia commented Mar 8, 2022 •

edited