(5.x) Merge 4.x #24254

asmorkalov · 2023-09-11T10:38:10Z

OpenCV Contrib: #3559
OpenCV Extra: #1093

#23607 from alexander-varjo:alexander-varjo-patch-1
#23734 from seanm:unaligned-copy
#23904 from kai-waang:removing-unreachable
#23965 from fengyuentau:broadcast_to
#23980 from hanliutong:rewrite-core
#24012 from cudawarped:videocapture_raw_read
#24086 from Kumataro:fix24081
#24089 from cudawarped:cuda_gpumat_fix_convertTo_copyTo_bindings
#24098 from 0xMihir:4.x
#24116 from chaebkimm/update-samples-python-tst_scene_render
#24120 from dkurt:actualize_dnn_links
#24122 from fengyuentau:remove_tengine
#24128 from CSBVision:CSBVision-patch-1
#24133 from alexlyulkov:al/fixed-msmf-webcam
#24138 from mshabunin:fix-gst-plugin-camera
#24139 from AleksandrPanov:fix_refineDetectedMarkers
#24140 from sthibaul:4.x
#24142 from beanjoy:4.x
#24143 from seanm:sprintf4
#24150 from DeePingXian:4.x
#24153 from Ginkgo-Biloba:ipp-warp-affine
#24156 from zihaomu:fix_24041
#24157 from dkurt:gapi_ov_optional
#24160 from mshabunin:update-ade
#24167 from autoantwort:missing-include
#24172 from CSBVision:CSBVision-patch-1-1
#24176 from dkurt:correct_perf_test
#24178 from dmatveev:dm/streaming_queue
#24179 from Kumataro:fix24145
#24180 from MambaWong:4.x
#24186 from dkurt:ts_fixture_constructor_skip
#24189 from dkurt:skip_ov_max_pool_ov
#24194 from vrabaud:compilation_fix
#24196 from dkurt:ov_backend_cleanups
#24199 from Kumataro:fixlibTiffSite
#24203 from thesamesam:arm64-fp16
#24204 from georgthegreat:mser-license
#24209 from alexlyulkov:al/fixed-mjpeg
#24211 from philsc:fix-asan-crash
#24214 from dkurt:distanceTransform_big_step
#24215 from Kumataro:fix24213
#24216 from dkurt:inter_lines_less_compute
#24218 from CSBVision:patch-5
#24221 from WanliZhong:issue_24016
#24223 from asmorkalov:as/24186_revert
#24227 from georgthegreat:missing-includes
#24228 from AleksandrPanov:fix_extendDictionary
#24232 from georgthegreat:missing-qualifiers
#24244 from alexlyulkov:al/update-dnn-js-face-recognition-sample
#24245 from alexlyulkov/al/update-fast-neural-style-dnn-sample
#24246 from asmorkalov:as/merge_input_check2
#24248 from opencv-pushbot:gitee/alalek/issue_22751
#24251 from dkurt:ov_build_debug
#24252 from opencv-pushbot:gitee/alalek/refactor_24218

Previous "Merge 4.x": #24119

force_builders=Linux AVX2,Custom
build_image:Docs=docs-js:18.04
build_image:Custom=javascript
buildworker:Custom=linux-1,linux-4,linux-f1

Although acceptible to Intel CPUs, it's still undefined behaviour according to the C++ standard. It can be replaced with memcpy, which makes the code simpler, and it generates the same assembly code with gcc and clang with -O2 (verified with godbolt). Also expanded the test to include other little endian CPUs by testing for __LITTLE_ENDIAN__.

… (rawMode == true)

dnn: cleanup of tengine backend opencv#24122 🚀 Cleanup for OpenCV 5.0. Tengine backend is added for convolution layer speedup on ARM CPUs, but it is not maintained and the convolution layer on our default backend has reached similar performance to that of Tengine. Tengine backend related PRs: - opencv#16724 - opencv#18323 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake

videoio: doc: add odd width or height limitation for FFMPEG

…rtTo_copyTo_bindings `cuda`: Fix `GpuMat::copyTo` and `GpuMat::converTo` python bindings

…LUGIN_ALL

…tst_scene_render Fix python sample code (tst_scene_render) opencv#24116 Fix bug of python sample code (samples/python/tst_scene_render.py) when backGr or fgr is None (opencv#24114) 1) pass shape tuple to np.zeros arguments instead of integers 2) change np.int to int ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [o] I agree to contribute to the project under Apache 2 License. - [o] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [o] The PR is proposed to the proper branch - [o] There is a reference to the original bug report and related work - [o] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [o] The feature is well documented and sample code can be built with the project CMake

Fixed bug when MSMF webcamera doesn't start when build with VIDEOIO_PLUGIN_ALL

It has the usual Unix filesystem operations.

@brief

Rewrite Universal Intrinsic code by using new API: Core module. opencv#23980 The goal of this PR is to match and modify all SIMD code blocks guarded by `CV_SIMD` macro in the `opencv/modules/core` folder and rewrite them by using the new Universal Intrinsic API. The patch is almost auto-generated by using the [rewriter](https://github.com/hanliutong/rewriter), related PR opencv#23885. Most of the files have been rewritten, but I marked this PR as draft because, the `CV_SIMD` macro also exists in the following files, and the reasons why they are not rewrited are: 1. ~~code design for fixed-size SIMD (v_int16x8, v_float32x4, etc.), need to manually rewrite.~~ Rewrited - ./modules/core/src/stat.simd.hpp - ./modules/core/src/matrix_transform.cpp - ./modules/core/src/matmul.simd.hpp 2. Vector types are wrapped in other class/struct, that are not supported by the compiler in variable-length backends. Can not be rewrited directly. - ./modules/core/src/mathfuncs_core.simd.hpp ```cpp struct v_atan_f32 { explicit v_atan_f32(const float& scale) { ... } v_float32 compute(const v_float32& y, const v_float32& x) { ... } ... v_float32 val90; // sizeless type can not used in a class v_float32 val180; v_float32 val360; v_float32 s; }; ``` 3. The API interface does not support/does not match - ./modules/core/src/norm.cpp Use `v_popcount`, ~~waiting for opencv#23966~~ Fixed - ./modules/core/src/has_non_zero.simd.hpp Use illegal Universal Intrinsic API: For float type, there is no logical operation `|`. Further discussion needed ```cpp /** @brief Bitwise OR Only for integer types. */ template<typename _Tp, int n> CV_INLINE v_reg<_Tp, n> operator|(const v_reg<_Tp, n>& a, const v_reg<_Tp, n>& b); template<typename _Tp, int n> CV_INLINE v_reg<_Tp, n>& operator|=(v_reg<_Tp, n>& a, const v_reg<_Tp, n>& b); ``` ```cpp #if CV_SIMD typedef v_float32 v_type; const v_type v_zero = vx_setzero_f32(); constexpr const int unrollCount = 8; int step = v_type::nlanes * unrollCount; int len0 = len & -step; const float* srcSimdEnd = src+len0; int countSIMD = static_cast<int>((srcSimdEnd-src)/step); while(!res && countSIMD--) { v_type v0 = vx_load(src); src += v_type::nlanes; v_type v1 = vx_load(src); src += v_type::nlanes; .... src += v_type::nlanes; v0 |= v1; //Illegal ? .... //res = v_check_any(((v0 | v4) != v_zero));//beware : (NaN != 0) returns "false" since != is mapped to _CMP_NEQ_OQ and not _CMP_NEQ_UQ res = !v_check_all(((v0 | v4) == v_zero)); } v_cleanup(); #endif ``` ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [ ] I agree to contribute to the project under Apache 2 License. - [ ] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake

`VideoCapture`: remove decoder initialization when demuxing

Fix GNU/Hurd build

Fixed invalid cast and unaligned memory access

Streamlabs Desktop has the same issue in opencv#19746. This fixes it using opencv#23460 method.

OCL_FP16 MatMul with large batch * Workaround FP16 MatMul with large batch * Fix OCL reinitialization * Higher thresholds for INT8 quantization * Try fix gemm_buffer_NT for half (columns) * Fix GEMM by rows * Add batch dimension to InnerProduct layer test * Fix Test_ONNX_conformance.Layer_Test/test_basic_conv_with_padding * Batch 16 * Replace all vload4 * Version suffix for MobileNetSSD_deploy Caffe model

asmorkalov · 2023-09-11T11:36:01Z

/cc @vpisarev @hanliutong Could you check, if all things merged correctly?

opencv-alalek

/cc @vpisarev To review merged changes with #23865
/cc @mshabunin To review merged changes with #23980

modules/imgproc/CMakeLists.txt

opencv-alalek · 2023-09-11T19:27:56Z

OpenCV Contrib: https://github.com/opencv/opencv/pull/3559
OpenCV Extra: https://github.com/opencv/opencv/pull/1093

Wrong cross-repo references.

mshabunin · 2023-09-12T13:01:36Z

@asmorkalov , 5.x branch does not compile for RISC-V at this moment, so I can not check whether recent patches have been applied correctly. I'll try to fix the build first, please give me some time.

mshabunin · 2023-09-12T21:06:59Z

modules/core/src/convert.hpp

@@ -11,7 +11,7 @@
 namespace cv
 {

-#if CV_SIMD
+#if (CV_SIMD || CV_SIMD_SCALABLE)


I've fixed the compilation, but this merge still can not be built because of incompatible code being added in 5.x (uses operators with intrinsics, for example vx_load_as in this #if block). It means either separate pass with refactoring tool is needed or manual adaptation of the code.

modules/core/src/convert.hpp

asmorkalov · 2023-09-14T05:34:33Z

modules/core/src/convert.hpp

-    const int nlanes = v_uint64::nlanes;
-    double buf[v_uint64::nlanes*2];


@vpisarev Are you sure in the type and buffer size here?

asmorkalov · 2023-09-14T05:35:08Z

@opencv-alalek The PR is ready for review.

opencv-alalek · 2023-09-14T06:39:50Z

@asmorkalov Need to trigger GHA for contrib PR. There are no build results at all.

asmorkalov · 2023-09-14T09:55:44Z

Done.

seanm and others added 30 commits June 9, 2023 18:56

removing unreachable codes in gbackend

d25d441

videoio: doc: add odd width or height limitation for FFMPEG

68968ed

cuda: Fix GpuMat::copyTo and GpuMat::converTo python bindings

bea0c1b

highgui(cocoa): fix fullscreen behavior

e1d0f07

VideoCapture: remove decoder initialization when CAP_PROP_FORMAT== -1…

e4ad7e3

… (rawMode == true)

style: remove trailing whitespace

afb406f

Merge pull request opencv#24086 from Kumataro:fix24081

9b5b254

videoio: doc: add odd width or height limitation for FFMPEG

Merge pull request opencv#24089 from cudawarped:cuda_gpumat_fix_conve…

eccfd98

…rtTo_copyTo_bindings `cuda`: Fix `GpuMat::copyTo` and `GpuMat::converTo` python bindings

Fixed bug when MSMF webcamera doesn't start when build with VIDEOIO_P…

4a12707

…LUGIN_ALL

videoio: fix camera opening with GStreamer plugin

53dfd95

Merge pull request opencv#24133 from alexlyulkov:al/fixed-msmf-webcam

3421b95

Fixed bug when MSMF webcamera doesn't start when build with VIDEOIO_PLUGIN_ALL

Fix GNU/Hurd build

82de5b3

It has the usual Unix filesystem operations.

Merge pull request opencv#24012 from cudawarpedЖvideocapture_raw_read

5b41134

`VideoCapture`: remove decoder initialization when demuxing

Merge pull request opencv#24140 from sthibaul:4.x

232c67b

Fix GNU/Hurd build

Merge pull request opencv#23734 from seanm:unaligned-copy

747b7ca

Fixed invalid cast and unaligned memory access

Adding support for Streamlabs Desktop Virtual Webcam

a300e7e

Streamlabs Desktop has the same issue in opencv#19746. This fixes it using opencv#23460 method.

Merge pull request opencv#24138 from mshabunin:fix-gst-plugin-camera

27d718b

fix ipp_warpAffine return value error

a301d1c

style: remove extraneous std::cout

fb34f36

Mark OpenVINO models for G-API tests optional

ad7ecf1

Merge pull request opencv#24153 from Ginkgo-Biloba:ipp-warp-affine

ace7817

gapi: update ADE library to 0.1.2b

8e52c01

fix the issue in layer fused

16681d1

Merge pull request opencv#24156 from zihaomu:fix_24041

8d1c73a

Merge pull request opencv#24150 from DeePingXian:4.x

abda763

asmorkalov requested review from mshabunin and opencv-alalek September 11, 2023 10:38

asmorkalov force-pushed the 5.x-merge-4.x branch from 13289e8 to 6af4de6 Compare September 11, 2023 11:48

asmorkalov added this to the 4.9.0 milestone Sep 11, 2023

opencv-alalek reviewed Sep 11, 2023

View reviewed changes

modules/imgproc/CMakeLists.txt Show resolved Hide resolved

opencv-alalek requested a review from vpisarev September 11, 2023 18:43

asmorkalov changed the title ~~(5.x) Merge 4.x~~ WIP: (5.x) Merge 4.x Sep 12, 2023

asmorkalov force-pushed the 5.x-merge-4.x branch 2 times, most recently from 70c9e4f to 0b0fb90 Compare September 12, 2023 16:22

mshabunin reviewed Sep 12, 2023

View reviewed changes

asmorkalov force-pushed the 5.x-merge-4.x branch 5 times, most recently from 75b55a4 to 538bd5c Compare September 13, 2023 08:52

asmorkalov assigned opencv-alalek Sep 13, 2023

asmorkalov commented Sep 13, 2023

View reviewed changes

modules/core/src/convert.hpp Outdated Show resolved Hide resolved

modules/core/src/convert.hpp Outdated Show resolved Hide resolved

Merge branch 4.x

fdab565

asmorkalov force-pushed the 5.x-merge-4.x branch from 538bd5c to fdab565 Compare September 13, 2023 11:51

asmorkalov changed the title ~~WIP: (5.x) Merge 4.x~~ (5.x) Merge 4.x Sep 13, 2023

asmorkalov commented Sep 14, 2023

View reviewed changes

opencv-alalek approved these changes Sep 14, 2023

View reviewed changes

asmorkalov merged commit fdab565 into opencv:5.x Sep 14, 2023

asmorkalov mentioned this pull request Sep 28, 2023

(5.x) Merge 4.x #24338

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

(5.x) Merge 4.x #24254

(5.x) Merge 4.x #24254

Uh oh!

asmorkalov commented Sep 11, 2023 •

edited

Loading

Uh oh!

asmorkalov commented Sep 11, 2023

Uh oh!

opencv-alalek left a comment

Uh oh!

Uh oh!

opencv-alalek commented Sep 11, 2023

Uh oh!

mshabunin commented Sep 12, 2023

Uh oh!

mshabunin Sep 12, 2023

Uh oh!

Uh oh!

Uh oh!

asmorkalov Sep 14, 2023

Uh oh!

asmorkalov commented Sep 14, 2023

Uh oh!

opencv-alalek commented Sep 14, 2023

Uh oh!

asmorkalov commented Sep 14, 2023 •

edited

Loading

Uh oh!

Uh oh!

		const int nlanes = v_uint64::nlanes;
		double buf[v_uint64::nlanes*2];

Uh oh!

(5.x) Merge 4.x #24254

(5.x) Merge 4.x #24254

Uh oh!

Conversation

asmorkalov commented Sep 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

asmorkalov commented Sep 11, 2023

Uh oh!

opencv-alalek left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

opencv-alalek commented Sep 11, 2023

Uh oh!

mshabunin commented Sep 12, 2023

Uh oh!

mshabunin Sep 12, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

asmorkalov Sep 14, 2023

Choose a reason for hiding this comment

Uh oh!

asmorkalov commented Sep 14, 2023

Uh oh!

opencv-alalek commented Sep 14, 2023

Uh oh!

asmorkalov commented Sep 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

asmorkalov commented Sep 11, 2023 •

edited

Loading

asmorkalov commented Sep 14, 2023 •

edited

Loading