cuda::flip - Use in-place npp function for inplace arguments #17863

nglee · 2020-07-16T13:43:31Z

Resolves #17840

~~cv::cuda::GpuMat::create does not allocate a new GpuMat if required size and type equals existing size and type.~~

Lines 160 to 161 in bf8136e

    
           if (rows == _rows && cols == _cols && type() == _type && data) 
        
               return;

~~This PR implements cv::cuda::detachOutput to allocate a new GpuMat in such cases. Unfortunately, _dst is const so assignment is not possible. Instead, this implementation clones src.~~

Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

I agree to contribute to the project under OpenCV (BSD) License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or other license that is incompatible with OpenCV
The PR is proposed to proper branch
There is reference to original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

alalek

It is not a CUDA-only issue for this particular function.

It is related to many CPU and OpenCL implementations (almost zero tests for this mode).
So some robust generic approach should be applied.

Possible fixes degrade performance, so we want to forbid such usages (to keep OpenCV library and applications optimized).
We don't want to trying fixing that and adding corresponding tests.

see also #13570 (it is closed without raising a general issue)

modules/core/src/cuda_gpu_mat.cpp

modules/core/include/opencv2/core/private.cuda.hpp

modules/core/src/cuda_gpu_mat.cpp

nglee · 2020-07-18T14:25:38Z

@alalek
The updated code referred to the following lines.
https://github.com/opencv/opencv/blob/3.4/modules/imgproc/src/filter.dispatch.cpp#L918-L924

Consulting this issue comment, should we raise an exception instead?

nglee · 2020-07-21T01:32:23Z

#17840 (comment)

Instead of copying the source matrix, this code calls in-place npp function if given in-place arguments.

asmorkalov · 2020-07-21T09:41:49Z

Looks good to me. Passed test on Ubuntu 18.04 with CUDA 10.2 and GPU 1080Ti.

asmorkalov

👍

asmorkalov · 2020-07-22T06:29:50Z

@nglee The same issue is reproducible with master branch too, but the code for master is moved to opencv_contrib repo. Could you propose yet another PR to contrib repository too?

nglee · 2020-07-22T07:40:07Z

@asmorkalov
I've opened one: opencv/opencv_contrib#2612

tomoaki0705 · 2020-07-28T04:00:17Z

nglee The same issue is reproducible with master branch too, but the code for master is moved to opencv_contrib repo. Could you propose yet another PR to contrib repository too?

@asmorkalov Once in the past, I was pointed from @alalek that this is not necessary, porting one PR from opencv to opencv_contrib.

#12666 (review)

once this modification is approved, I'll open a PR for opencv_contrib, too.

It is not necessary. I will merge these changes during next regular "Merge 3.4" PRs.

And alalek also did the same recently
original
ported

Was there any special reason for this PR to be ported to opencv_contrib ?
I'm just curious.

asmorkalov · 2020-07-28T08:10:02Z

It's good idea to have CUDA implementations aligned between branches, especially if the patch could be ported as is without serious modifications.

tomoaki0705 · 2020-07-29T07:11:53Z

@asmorkalov , my point is not "why", but "who".
If I make modification for CUDA stuff in 3.4 branch, should "I" port it to contrib, or can I keep it untouched so "alalek" can port it ?

asmorkalov · 2020-07-29T08:55:47Z

General approach is to merge 3.4 to master weekly or so. Alalek does it and you usually hould not worry about it. Cuda changes cannot be just merged to contrib with git and some manual work is required. I usually ask the PR author to do porting as soon as author is in context can can do it faster.

nglee force-pushed the dev_cudaDetachOutput branch from 3900414 to bfefe79 Compare July 16, 2020 17:42

nglee marked this pull request as ready for review July 16, 2020 17:57

alalek reviewed Jul 17, 2020

View reviewed changes

modules/core/src/cuda_gpu_mat.cpp Outdated Show resolved Hide resolved

modules/core/include/opencv2/core/private.cuda.hpp Outdated Show resolved Hide resolved

modules/core/src/cuda_gpu_mat.cpp Outdated Show resolved Hide resolved

nglee force-pushed the dev_cudaDetachOutput branch from bfefe79 to bb34398 Compare July 18, 2020 14:12

nglee force-pushed the dev_cudaDetachOutput branch from bb34398 to 58dcd52 Compare July 18, 2020 21:10

asmorkalov added category: gpu/cuda (contrib) OpenCV 4.0+: moved to opencv_contrib and removed RFC labels Jul 20, 2020

Use in-place npp function for inplace arguments

9411cd6

nglee force-pushed the dev_cudaDetachOutput branch from 58dcd52 to 9411cd6 Compare July 21, 2020 01:28

nglee changed the title ~~Implement cv::cuda::detachOutput~~ cuda::flip - Use in-place npp function for inplace arguments Jul 21, 2020

nglee marked this pull request as draft July 21, 2020 02:37

nglee marked this pull request as ready for review July 21, 2020 12:30

mshabunin requested a review from asmorkalov July 21, 2020 16:41

mshabunin assigned asmorkalov Jul 21, 2020

asmorkalov approved these changes Jul 22, 2020

View reviewed changes

asmorkalov added the backport is needed Label for maintainers. Authors of PR can ignore this label Jul 22, 2020

nglee mentioned this pull request Jul 22, 2020

cuda::flip - Use in-place npp function for inplace arguments opencv/opencv_contrib#2612

Merged

opencv-pushbot merged commit 0fa06b1 into opencv:3.4 Jul 22, 2020

nglee deleted the dev_cudaDetachOutput branch July 22, 2020 10:38

mshabunin added port/backport done Label for maintainers. Authors of PR can ignore this and removed backport is needed Label for maintainers. Authors of PR can ignore this labels Jul 22, 2020

asmorkalov mentioned this pull request Jul 23, 2020

In-place flip of GpuMat produces image artifacs #17840

Closed

4 tasks

alalek mentioned this pull request Jul 28, 2020

Merge 3.4 #17970

Merged

tomoaki0705 mentioned this pull request Sep 16, 2020

cudaarithm: inplace version of NPP flip fails with odd number ROI #18347

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cuda::flip - Use in-place npp function for inplace arguments #17863

cuda::flip - Use in-place npp function for inplace arguments #17863

nglee commented Jul 16, 2020 •

edited

Loading

alalek left a comment

nglee commented Jul 18, 2020

nglee commented Jul 21, 2020

asmorkalov commented Jul 21, 2020

asmorkalov left a comment

asmorkalov commented Jul 22, 2020

nglee commented Jul 22, 2020

tomoaki0705 commented Jul 28, 2020

asmorkalov commented Jul 28, 2020

tomoaki0705 commented Jul 29, 2020

asmorkalov commented Jul 29, 2020

	if (rows == _rows && cols == _cols && type() == _type && data)
	return;

cuda::flip - Use in-place npp function for inplace arguments #17863

cuda::flip - Use in-place npp function for inplace arguments #17863

Conversation

nglee commented Jul 16, 2020 • edited Loading

Pull Request Readiness Checklist

alalek left a comment

Choose a reason for hiding this comment

nglee commented Jul 18, 2020

nglee commented Jul 21, 2020

asmorkalov commented Jul 21, 2020

asmorkalov left a comment

Choose a reason for hiding this comment

asmorkalov commented Jul 22, 2020

nglee commented Jul 22, 2020

tomoaki0705 commented Jul 28, 2020

asmorkalov commented Jul 28, 2020

tomoaki0705 commented Jul 29, 2020

asmorkalov commented Jul 29, 2020

nglee commented Jul 16, 2020 •

edited

Loading