cv::cuda::GpuMat.convertTo() seems not to support in-place, while cv::Mat does #13092

chacha21 · 2018-11-09T13:10:52Z

Visual Studio 2017, Windows 7 64 bits, OpenCV 3.4.3
Cuda 10

the following code raises an exception :

  cv::cuda::GpuMat m(cv::Size(1280, 1024), CV_32FC1);
  cv::cuda::Stream stream;
  m.convertTo(m, CV_8UC1, 1, 0, stream);
  cv::cuda::threshold(m, m, 128, 255, cv::THRESH_BINARY, stream);

OpenCV(3.4.3) Error: Gpu API call (an illegal memory access was encountered) in
cv::cudev::grid_transform_detail::TransformDispatcher<true, Policy>::call, file
e:\opencv-3.4.3\opencv\sources\modules\cudev\include\opencv2\cudev\grid\detail/t
ransform.hpp, line 318

Using a second matrix as a destination for the convertTo() gets rid of the problem.

So it seems that convertTo() is no safe to use in-place with cuda::GpuMat, while the same code with cv::Mat would be ok.
Is it a bug or a missing documentation ?

The text was updated successfully, but these errors were encountered:

chacha21 · 2018-11-30T22:42:55Z

Maybe a shared problem with #13149 ?

harsv · 2019-05-15T00:01:47Z

I am having similar problem with opencv 3.4, Cuda10, Ubuntu18.04
This code works:
` cv::cuda::GpuMat gpu_image;

gpu_image_init.convertTo(gpu_image, CV_32FC3, 1.0, 0);`

However, this does not works:
gpu_image_init.convertTo(gpu_image_init, CV_32FC3, 1.0, 0);

In my case, the code runs but the resultant matrix has garbage values.

nglee · 2020-07-08T18:31:29Z

I am not able to reproduce this error on the latest 3.4 branch. (win10, vs2019, cuda10.2)
Although we are allowed to call GpuMat::convertTo() with one GpuMat for both input and output, it is not strictly an in-place conversion. Inside GpuMat::convertTo(), it allocates device memory for result data.

chacha21 · 2020-07-08T18:54:14Z

As long as it does not crash, it is not a problem that it is not a real "in-place". I will check with my 4.3.0 build tomorrow

chacha21 · 2020-07-09T07:51:22Z

The bug (as mentionned in initial post) is still present in 4.3.0

nglee · 2020-07-09T18:23:37Z

This is getting confusing. Even with Ubuntu 18.04(Jetson TX2) and the latest master branch build(4.4.0-pre), I wasn't able to reproduce the error. I've added the following googletest code to cudaarithm test codes and it seems to be working.

TEST(Issue13092, Issue13092)
{
    cv::cuda::GpuMat m(cv::Size(1280, 1024), CV_32FC1);
    cv::cuda::Stream stream;
    m.convertTo(m, CV_8UC1, 1, 0, stream);
    cv::cuda::threshold(m, m, 128, 255, cv::THRESH_BINARY, stream);
}

$ ./opencv_test_cudaarithm --gtest_filter=*13092*

...

deviceQuery, CUDA Driver = CUDART, CUDA Driver Version  = 10.0, CUDA Runtime Version = 10.0, NumDevs = 1

CTEST_FULL_OUTPUT
OpenCV version: 4.4.0-pre
OpenCV VCS version: 4.3.0-586-gd0e6d2438c
Build type: Release
Compiler: /usr/bin/c++  (ver 7.5.0)
Parallel framework: pthreads (nthreads=6)
CPU features: NEON? FP16?
OpenCL is disabled
TEST: Skip tests with tags: 'mem_6gb', 'verylong'
Note: Google Test filter = *13092*
[==========] Running 1 test from 1 test case.
[----------] Global test environment set-up.
[----------] 1 test from Issue13092
[ RUN      ] Issue13092.Issue13092
[       OK ] Issue13092.Issue13092 (550 ms)
[----------] 1 test from Issue13092 (550 ms total)

[----------] Global test environment tear-down
[==========] 1 test from 1 test case ran. (551 ms total)
[  PASSED  ] 1 test.

I'll try again with x64 machine tomorrow.

chacha21 · 2020-07-09T19:14:22Z

I forgot to mention that I now use VS2019 and CUDA 10.2 but it is of less importance; apart form that this is still under W7 64b

nglee · 2020-07-17T04:28:17Z

After increasing the GpuMat size, I see the same issue. The GPU model is RTX 2080 Ti.

mshabunin · 2020-07-23T11:12:19Z

Perhaps this issue is similar to #17840 which have been resolved by using CUDA function designed for in-place processing.

nglee · 2020-07-29T16:08:02Z

@chacha21 @mshabunin
I believe this can fix this issue.
Could you please take a look?

chacha21 · 2020-07-30T07:23:42Z

I confirm that it seems to fix the problem.

nglee mentioned this issue Jul 29, 2020

cuda::GpuMat::convertTo - fix for in-place arguments #17982

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cv::cuda::GpuMat.convertTo() seems not to support in-place, while cv::Mat does #13092

cv::cuda::GpuMat.convertTo() seems not to support in-place, while cv::Mat does #13092

chacha21 commented Nov 9, 2018 •

edited

chacha21 commented Nov 30, 2018

harsv commented May 15, 2019 •

edited

nglee commented Jul 8, 2020 •

edited

chacha21 commented Jul 8, 2020

chacha21 commented Jul 9, 2020

nglee commented Jul 9, 2020

chacha21 commented Jul 9, 2020 •

edited

nglee commented Jul 17, 2020

mshabunin commented Jul 23, 2020

nglee commented Jul 29, 2020

chacha21 commented Jul 30, 2020

cv::cuda::GpuMat.convertTo() seems not to support in-place, while cv::Mat does #13092

cv::cuda::GpuMat.convertTo() seems not to support in-place, while cv::Mat does #13092

Comments

chacha21 commented Nov 9, 2018 • edited

chacha21 commented Nov 30, 2018

harsv commented May 15, 2019 • edited

nglee commented Jul 8, 2020 • edited

chacha21 commented Jul 8, 2020

chacha21 commented Jul 9, 2020

nglee commented Jul 9, 2020

chacha21 commented Jul 9, 2020 • edited

nglee commented Jul 17, 2020

mshabunin commented Jul 23, 2020

nglee commented Jul 29, 2020

chacha21 commented Jul 30, 2020

chacha21 commented Nov 9, 2018 •

edited

harsv commented May 15, 2019 •

edited

nglee commented Jul 8, 2020 •

edited

chacha21 commented Jul 9, 2020 •

edited