Support Transpose op in TFlite #25297

CNOCycle · 2024-03-29T14:21:22Z

Pull Request Readiness Checklist

Merge with extra: opencv/opencv_extra#1168

The purpose of this PR is to introduce support for the Transpose op in TFlite format and to add a shape comparison between the output tensors and the references. In some occasional cases, the shape of the output tensor is [1,4,1,1], while the shape of the reference tensor is [1,4]. Consequently, the norm check incorrectly reports that the test has passed, as the residual is zero.

Below is a Python script for generating testing data. The generated data can be integrated into the repo opencv_extra.

import numpy as np
import tensorflow as tf

PREFIX_TFL = '/path/to/opencv_extra/testdata/dnn/tflite/'

def generator(input_tensor, model, saved_name):

    # convert keras model to .tflite format
    converter = tf.lite.TFLiteConverter.from_keras_model(model)
    #converter.optimizations = [tf.lite.Optimize.DEFAULT]
    converter.optimizations = [None]
    tflite_model = converter.convert()
    with open(f'{PREFIX_TFL}/{saved_name}.tflite', 'wb') as f:
        f.write(tflite_model)

    # save the input tensor to .npy
    if input_tensor.ndim == 4:
        opencv_tensor = np.transpose(input_tensor, (0,3,1,2))
    else:
        opencv_tensor = input_tensor
    opencv_tensor = np.copy(opencv_tensor, order='C').astype(np.float32)
    np.save(f'{PREFIX_TFL}/{saved_name}_inp.npy', opencv_tensor)

    # generate output tenosr and save it to .npy
    mat_out = model(input_tensor).numpy()
    mat_out = np.copy(mat_out, order='C').astype(np.float32)
    if mat_out.ndim == 4:
        mat_out = np.transpose(mat_out, (0,3,1,2))
    interpreter = tf.lite.Interpreter(model_content=tflite_model)
    out_name = interpreter.get_output_details()[0]['name']
    np.save(f'{PREFIX_TFL}/{saved_name}_out_{out_name}.npy', mat_out)

def build_transpose():

    model_name = "keras_permute"
    mat_in = np.array([[[1,2,3], [4,5,6]]], dtype=np.float32)

    model = tf.keras.Sequential()
    model.add(tf.keras.Input(shape=(2,3)))
    model.add(tf.keras.layers.Permute((2,1)))
    model.summary()

    generator(mat_in, model, model_name)

if __name__ == '__main__':
    build_transpose()

I agree to contribute to the project under Apache 2 License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
The PR is proposed to the proper branch
There is a reference to the original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

dkurt · 2024-03-30T10:41:05Z

@CNOCycle, please open a PR to https://github.com/opencv/opencv_extra/ with the same branch name as here.

modules/dnn/src/tflite/tflite_importer.cpp

modules/dnn/test/test_tflite_importer.cpp

CNOCycle · 2024-04-01T06:28:13Z

Hi, @dkurt,

Thank you for your valuable feedback. After careful consideration, I have decided to split this PR into two separate ones. The first will focus on implementing shape checkers, while the second will address the support for the Transpose op.

Regarding the Transpose op support, I have chosen to postpone it until we resolve the shape issues. There are a couple of reasons for this decision. Firstly, I need additional time to seamlessly integrate my stand-alone script for generating testing data into the existing script in the repository opencv/opencv_extra. Secondly, upon conducting tests in my local environment, I discovered that the layout issue is more intricate than initially anticipated.

Specifically, I added the shape checker as you suggested (ASSERT_EQ(ref.size, outs[i].size);) on the branch 4.x (eba158f) but encountered failures in at least two tests. The details are outlined below:

[ RUN      ] Test_TFLite.face_landmark/0, where GetParam() = OCV/CPU
... ...
Expected equality of these values:
  ref.size
    Which is: 1 x 1 x 1 x 1404
  outs[i].size
    Which is: 1 x 1404 x 1 x 1
[  FAILED  ] Test_TFLite.face_landmark/0, where GetParam() = OCV/CPU (3133 ms)

[ RUN      ] Test_TFLite.selfie_segmentation/0, where GetParam() = OCV/CPU
... ...
Expected equality of these values:
  ref.size
    Which is: 256 x 256
  outs[i].size
    Which is: 1 x 1 x 256 x 256
[  FAILED  ] Test_TFLite.selfie_segmentation/0, where GetParam() = OCV/CPU (5182 ms)

[==========] 8 tests from 1 test case ran. (382801 ms total)
[  PASSED  ] 6 tests.
[  FAILED  ] 2 tests, listed below:
[  FAILED  ] Test_TFLite.face_landmark/0, where GetParam() = OCV/CPU
[  FAILED  ] Test_TFLite.selfie_segmentation/0, where GetParam() = OCV/CPU

 2 FAILED TESTS

These errors are reproducible on RISC-V RVV with Debug mode and x64 with Release mode. The introduced shape checker effectively identifies the inconsistent shape issue. Do you have any insights or suggestions regarding these errors?

dkurt · 2024-04-01T10:39:37Z

@CNOCycle, this is a data layout issue I mentioned before. TFLite/TensorFlow work with NHWC by default, OpenCV with NCHW. So during a layer import you have to change axes order:

std::vector<int> perm = allTensors[op.inputs()->Get(1)];

DataLayout inpLayout = layouts[op.inputs()->Get(0)];
if (inpLayout == DNN_LAYOUT_NHWC && perm.size() == 4) {
    static const int order[] = {0, 2, 3, 1};  // NHWC -> NCHW
    for (int& dim: perm) {
        CV_Assert(dim >= 0 && dim < 4);
        dim = order[dim];
    }
}

If in your test example inpLayout is different, please add a simple Conv2D layer before permutation to TFLite model.

CNOCycle · 2024-04-01T11:45:30Z

Thank you for providing a clear explanation of the data layout issue. It's important to note that the failed tests I mentioned earlier are not new tests related to the Transpose op; they are existing tests in the opencv repo.

Upon direct inspection of the .tflite models, I observed that the shapes of the output tensors from face_landmark.tflite and selfie_segmentation.tflite are [1,1,1,1404] and [1, 256, 256, 1], respectively. Based on my understanding, the expected shapes in OpenCV should be [1, 1404, 1, 1] and [1, 1, 256, 256]. Consequently, it appears that there is a missing axis swap in the first model, and the shape of the reference tensor in the second model seems to be incorrect. Fortunately, these errors can be easily rectified by updating the shapes of the reference tensors. Please correct me if I am mistaken.

dkurt · 2024-04-05T06:52:41Z

@CNOCycle, thanks for the observation about shapes. I verified that test data for both models were saved in native view, without necessary reshaping. We can fix it by updating test data but I prefer to just add a workaround in test engine:

    ASSERT_EQ(outs.size(), outNames.size());
    for (int i = 0; i < outNames.size(); ++i) {
        Mat ref = blobFromNPY(findDataFile(format("dnn/tflite/%s_out_%s.npy", modelName.c_str(), outNames[i].c_str())));
        if (modelName == "face_landmark" || modelName == "selfie_segmentation") {
            ref = ref.reshape(1, 1);
            outs[i] = outs[i].reshape(1, 1);
        }
        normAssert(ref, outs[i], outNames[i].c_str(), l1, lInf);
    }

Note that normAssert will check the shapes.

asmorkalov · 2024-04-09T06:26:53Z

@CNOCycle I merged another patch for TFLite tests and it generates conflict. Could you rebase your PR and fix the conflict.

CNOCycle · 2024-04-09T13:06:29Z

Apologies for the delayed response. I encountered a link error while attempting to build the scalable RVV on Debug mode from the latest 4.x branch. I'm unsure of the origin of this error. Nevertheless, I will verify the correctness of this PR using x64 mode or another mode, and promptly push a new one based on the latest branch.

asmorkalov · 2024-04-11T11:07:30Z

@dkurt is it ready for merge?

dkurt · 2024-04-11T12:48:14Z

modules/dnn/src/tflite/tflite_importer.cpp

+        else if (perm[1] == 3 && perm[2] == 2 && perm[3] == 1) {
+            std::vector<int> orderLP = {0, 2, 1, 3};
+            layerParams.set("order", DictValue::arrayInt<int*>(orderLP.data(), orderLP.size()));
+        }


Also, change layout of output:

opencv/modules/dnn/src/tensorflow/tf_importer.cpp

Line 1339 in 197626a

void TFImporter::parseTranspose(tensorflow::GraphDef& net, const tensorflow::NodeDef& layer, LayerParams& layerParams)

dkurt · 2024-04-11T12:49:11Z

modules/dnn/src/tflite/tflite_importer.cpp

+        }
+        if (perm[1] == 1 && perm[2] == 2 && perm[3] == 3) {
+            std::vector<int> orderLP = {0, 1, 2, 3};
+            layerParams.set("order", DictValue::arrayInt<int*>(orderLP.data(), orderLP.size()));


reduce code duplications

CNOCycle · 2024-04-11T14:00:31Z

Hi @dkurt,

Thank you for the valuable feedback. Concerning the code complexity issues, it's essential to note that the transpose operation in TF implementation also encompasses six cases to handle data layout.

opencv/modules/dnn/src/tensorflow/tf_importer.cpp

Lines 1354 to 1396 in 197626a

    
           if (inpLayout == DNN_LAYOUT_NHWC) 
        
           { 
        
               if (permData[0] == 0 && permData[1] == 3 && permData[2] == 1 && permData[3] == 2) 
        
               { 
        
                   // in TensorFlow: NHWC->NCHW 
        
                   // in OpenCV: NCHW->NCHW 
        
                   data_layouts[name] = DNN_LAYOUT_NCHW; 
        
               } 
        
               else if (permData[0] == 0 && permData[1] == 1 && permData[2] == 2 && permData[3] == 3) 
        
               { 
        
                   // in TensorFlow: NHWC->NHWC 
        
                   // in OpenCV: NCHW->NCHW 
        
                   data_layouts[name] = DNN_LAYOUT_NHWC; 
        
               } 
        
               else if (permData[0] == 0 && permData[1] == 3 && permData[2] == 2 && permData[3] == 1) 
        
               { 
        
                   // in TensorFlow: NHWC->NCWH 
        
                   // in OpenCV: NCHW->NCWH 
        
                   int permData[] = {0, 1, 3, 2}; 
        
                   layerParams.set("order", DictValue::arrayInt<int*>(permData, perm.total())); 
        
                   data_layouts[name] = DNN_LAYOUT_NCHW;  // we keep track NCHW because channels position only matters 
        
                   type = "Permute"; 
        
               } 
        
               else 
        
                   CV_Error(Error::StsParseError, "Only NHWC <-> NCHW permutations are allowed."); 
        
           } 
        
           else if (inpLayout == DNN_LAYOUT_NCHW) 
        
           { 
        
               if (permData[0] == 0 && permData[1] == 2 && permData[2] == 3 && permData[3] == 1) 
        
               { 
        
                   // in TensorFlow: NCHW->NHWC 
        
                   // in OpenCV: NCHW->NCHW 
        
                   data_layouts[name] = DNN_LAYOUT_NHWC; 
        
               } 
        
               else if (permData[0] == 0 && permData[1] == 1 && permData[2] == 2 && permData[3] == 3) 
        
               { 
        
                   // in TensorFlow: NCHW->NCHW 
        
                   // in OpenCV: NCHW->NCHW 
        
                   data_layouts[name] = DNN_LAYOUT_NCHW; 
        
               } 
        
               else 
        
                   CV_Error(Error::StsParseError, "Only NHWC <-> NCHW permutations are allowed."); 
        
           }

The rationale behind specifying these six cases is elucidated in the comment:

Since applying the NCHW permutation to a NCHW tensor mirrors the NHWC permutation applied to an NHWC tensor, n additional NHWC -> NCHW conversion is requred to match the data layout.

To ensure alignment with the NCHW layout, merely applying a NHWC -> NCHW conversion to permutation vector is insufficient. The sequence of the order vector varies based on the given permutation vector.

Allow me to illustrate what I mean through the following demonstration. Suppose we have a vector A = [N, H, W, C] and a permutation vector P = [0, 1, 2, 3]. After the transpose operation, the output should be [N, H, W, C], while the expected format for OpenCV is [N, C, H, W]. Once the input is represented in a channel-first format, denoted as At = [N, C, H, W], the permutation vector should be adjusted accordingly. Applying NHWC -> NCHW conversion to the permutation vector, it becomes Pt = [0, 3, 1, 2]. However, applying Pt on At results in [N, W, C, H], which is an incorrect outcome.

As evidenced, when providing At = [N, C, H, W] with P = [0, 1, 2, 3], the expected output is [N, C, H, W], resulting in the order vector being set to [0, 1, 2, 3]. The mapping of the order vector for six cases is shown below:

A = [N, H, W, C], At = [N, C, H, W]
case1: P = [0, 1, 2, 3] ->Ap = [N, H, W, C] -> Acv = [N, C, H, W] -> order = [0, 1, 2, 3]
case2: P = [0, 1, 3, 2] ->Ap = [N, H, C, W] -> Acv = [N, W, H, C] -> order = [0, 3, 2, 1]
case3: P = [0, 2, 1, 3] ->Ap = [N, W, H, C] -> Acv = [N, C, W, H] -> order = [0, 1, 3, 2]
case4: P = [0, 2, 3, 1] ->Ap = [N, W, C, H] -> Acv = [N, H, W, C] -> order = [0, 2, 3, 1]
case5: P = [0, 3, 1, 2] ->Ap = [N, C, H, W] -> Acv = [N, W, C, H] -> order = [0, 3, 1, 2]
case6: P = [0, 3, 2, 1] ->Ap = [N, C, W, H] -> Acv = [N, H, C, W] -> order = [0, 2, 1, 3]

modules/dnn/src/tflite/tflite_importer.cpp

asmorkalov · 2024-04-16T08:37:03Z

modules/dnn/src/tflite/tflite_importer.cpp:734: tab in indent.
+	// For implementation details, please refer to the disscusion:

CNOCycle · 2024-04-24T02:25:24Z

Hi @dkurt

Is there any progress on this PR?

asmorkalov · 2024-04-24T08:48:36Z

Run cd /home/ci/opencv
modules/dnn/src/tflite/tflite_importer.cpp:734: tab in indent.
+	// For implementation details, please refer to the disscusion:
modules/dnn/src/tflite/tflite_importer.cpp:735: tab in indent.
+	// https://github.com/opencv/opencv/pull/25297#issuecomment-2049762298

dkurt · 2024-04-24T08:49:48Z

@CNOCycle, please consider review comments

CNOCycle · 2024-04-24T12:56:51Z

@dkurt

I have fixed the tab issue and provided 5 test cases to verify correctness. If still have any concerns about this PR, please let me know. Thanks.

asmorkalov · 2024-05-06T05:47:45Z

@dkurt could you take a look again?

CNOCycle · 2024-05-06T05:55:14Z

sorry for late reply. I will re-submit a revised one today.

asmorkalov requested a review from dkurt March 29, 2024 14:35

asmorkalov added category: dnn feature labels Mar 29, 2024

asmorkalov added this to the 4.10.0 milestone Mar 29, 2024

dkurt reviewed Mar 30, 2024

View reviewed changes

modules/dnn/src/tflite/tflite_importer.cpp Show resolved Hide resolved

dkurt reviewed Mar 30, 2024

View reviewed changes

modules/dnn/test/test_tflite_importer.cpp Outdated Show resolved Hide resolved

dkurt mentioned this pull request Mar 31, 2024

Support more ops in .tflite format #25296

Open

dkurt self-assigned this Apr 5, 2024

CNOCycle mentioned this pull request Apr 8, 2024

Add a shape checker for tflite models #25372

Merged

6 tasks

CNOCycle force-pushed the tflite/transpose branch from f920e1e to 6e262ff Compare April 9, 2024 16:08

CNOCycle mentioned this pull request Apr 9, 2024

Support Transpose op in TFlite opencv/opencv_extra#1168

Open

dkurt reviewed Apr 11, 2024

View reviewed changes

dkurt reviewed Apr 14, 2024

View reviewed changes

modules/dnn/src/tflite/tflite_importer.cpp Show resolved Hide resolved

CNOCycle added 7 commits May 6, 2024 14:16

Support Transpose op in .tflite model

f61eb43

Add unittest for TRANSPOSE op in .tflite models

18f8eac

Fix for incorrect permuations

f55bcc6

Disable a 4d permutation test

33f44c1

Add the missing reference

735feab

Fix for the tab issue

cf65653

Correct output layout

24259f2

CNOCycle force-pushed the tflite/transpose branch from 9eb19ba to 24259f2 Compare May 6, 2024 14:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Transpose op in TFlite #25297

Support Transpose op in TFlite #25297

CNOCycle commented Mar 29, 2024 •

edited by dkurt

dkurt commented Mar 30, 2024

CNOCycle commented Apr 1, 2024

dkurt commented Apr 1, 2024 •

edited

CNOCycle commented Apr 1, 2024

dkurt commented Apr 5, 2024

asmorkalov commented Apr 9, 2024

CNOCycle commented Apr 9, 2024

asmorkalov commented Apr 11, 2024

dkurt Apr 11, 2024

dkurt Apr 11, 2024

dkurt Apr 24, 2024

CNOCycle commented Apr 11, 2024

asmorkalov commented Apr 16, 2024

CNOCycle commented Apr 24, 2024

asmorkalov commented Apr 24, 2024

dkurt commented Apr 24, 2024

CNOCycle commented Apr 24, 2024

asmorkalov commented May 6, 2024

CNOCycle commented May 6, 2024

Support Transpose op in TFlite #25297

Are you sure you want to change the base?

Support Transpose op in TFlite #25297

Conversation

CNOCycle commented Mar 29, 2024 • edited by dkurt

Pull Request Readiness Checklist

dkurt commented Mar 30, 2024

CNOCycle commented Apr 1, 2024

dkurt commented Apr 1, 2024 • edited

CNOCycle commented Apr 1, 2024

dkurt commented Apr 5, 2024

asmorkalov commented Apr 9, 2024

CNOCycle commented Apr 9, 2024

asmorkalov commented Apr 11, 2024

dkurt Apr 11, 2024

Choose a reason for hiding this comment

dkurt Apr 11, 2024

Choose a reason for hiding this comment

dkurt Apr 24, 2024

Choose a reason for hiding this comment

CNOCycle commented Apr 11, 2024

asmorkalov commented Apr 16, 2024

CNOCycle commented Apr 24, 2024

asmorkalov commented Apr 24, 2024

dkurt commented Apr 24, 2024

CNOCycle commented Apr 24, 2024

asmorkalov commented May 6, 2024

CNOCycle commented May 6, 2024

CNOCycle commented Mar 29, 2024 •

edited by dkurt

dkurt commented Apr 1, 2024 •

edited