add support for device selection and multiple GPUs #121

mitjap · 2020-11-20T13:00:12Z

Description

This pull request enables user to select on which GPU algorithm should run and makes it possible to run on multiple GPUs.

Features list

Select CUDA device
Ability to run on multiple CUDA devices

Implementation remarks

For use of multiple GPUs this implementation requires multiple PopSift instances. Main issue was that algorithm uses global state with extern variables. I made those thread_local which enables each thread to have its own value for specific device.

I have not tested matching (ProcessingMode == MatchingMode).

mitjap · 2020-11-20T13:06:16Z

In case you decide not to accept this PR, you should at least fix this very minor memory leak.

popsift/src/popsift/sift_pyramid.cu

Line 215 in 290e142

cudaFree( _d_extrema_num_blocks );

griwodz

I am not certain if the thread_local can fail for anybody. It can probably not fail for hct, hbuf and so on because those are only used in a thread spawned by PopSift.

I understand that the thread_local forces you to init the filter and the configuration in the extraction/match threads. That leads to more frequent configuration calls, is that problematic?

griwodz · 2020-11-20T13:43:17Z

src/popsift/popsift.cpp

@@ -313,14 +339,21 @@ void PopSift::extractDownloadLoop( )

        job->setFeatures( features );
    }
+
+    private_unit();


Do I understand correct that you want to delete the Pyramid every time you have downloaded the features? That would crash if several images have been queued for feature extraction, wouldn't it?

Pyramid is deleted only after pipeline stops using PopSift::uninit function. Note that private_unit (this is actually a typo and should be private_uninit) is called outside while loop.

Shouldn't you add private_uninit also to the matchPrepareLoop?

Yes, you are right.

griwodz · 2020-11-20T13:45:35Z

src/popsift/popsift.cpp

 {
+    cudaSetDevice(_device);


This looks like a good thing to do. Perhaps with an error check when a user chooses an non-existant device?

I agree. Do you think using device_prop_t::set(int, bool) would be good? Another solution would be manual check with

POP_CUDA_FATAL_TEST( cudaSetDevice( currentDevice ), "Cannot set device" );

or maybe just POP_CHK.

mitjap · 2020-11-20T14:31:52Z

I understand that the thread_local forces you to init the filter and the configuration in the extraction/match threads. That leads to more frequent configuration calls, is that problematic?

Filter function checks if configuration differs so I don't expect it to call any CUDA functions.

Second applyConfiguration() which is within while loop is to support following usage:

PopSift sift(PopSift::ByteImages, device); // initializes with default configuration in spawned thread
sift.configure(config); // this configuration is applied when first image is enqueued.

mitjap · 2020-11-20T14:33:50Z

I am not certain if the thread_local can fail for anybody. It can probably not fail for hct, hbuf and so on because those are only used in a thread spawned by PopSift.

I'm sorry I don't quite understand what it is you want to say here.

mitjap · 2020-11-20T14:51:37Z

For more robust interface maybe there should be another call to cudaSetDevice(_device) in PopSift::uninit function so that we make sure that device images are properly deleted.

griwodz

I'm sorry that it took me so long before I could review the code. I hope that your improvements have been working well for you.

I have two remaining change requests before approving:
(1) I think that private_unit() should also be called at the end of matchPrepareLoop()
(2) private_uninit would be a better name than private_unit

simogasp · 2020-12-24T14:57:00Z

@mitjap if u can also please update CHANGES.md under v1.0.0 and add one line with the content of this PR

griwodz · 2021-01-04T17:33:32Z

@simogasp Should I merge this PR and follow up with the additional fixes? Can't push to the original branch.

mitjap · 2021-01-04T17:38:02Z

If by "pushing to original branch" you mean my branch I think I enabled pushing for popsift maintainers. I can make code changes as requested but at the moment I don't have the time to test it properly.

griwodz · 2021-01-04T18:13:46Z

@mitjap Thanks, I'll give it another try tomorrow. I had cloned your repo and tried to push the uninit in match and it didn't work, but I may have made some other mistake.

mitjap · 2021-01-04T19:55:00Z

I made the requested changes to the code.

griwodz

Thank you for the fixes!

add support for device selection and multiple GPUs

290e142

mitjap mentioned this pull request Nov 20, 2020

no descriptors extracted if i set the deviceInfo ( speed is not stable) [bug] #117

Closed

simogasp requested a review from griwodz November 20, 2020 13:03

simogasp assigned griwodz Nov 20, 2020

simogasp added this to the v1.0.0 milestone Nov 20, 2020

simogasp added cuda issues related to cuda versions type:enhancement labels Nov 20, 2020

griwodz reviewed Nov 20, 2020

View reviewed changes

mitjap requested a review from griwodz November 24, 2020 10:42

griwodz requested changes Dec 24, 2020

View reviewed changes

griwodz approved these changes Dec 24, 2020

View reviewed changes

minor improvements

e9cd9ad

griwodz approved these changes Jan 5, 2021

View reviewed changes

griwodz merged commit 5bbd332 into alicevision:develop Jan 5, 2021

mitjap deleted the multi_gpu branch January 5, 2021 13:49

simogasp modified the milestones: v1.0.0, v0.9.1 Mar 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add support for device selection and multiple GPUs #121

add support for device selection and multiple GPUs #121

mitjap commented Nov 20, 2020

mitjap commented Nov 20, 2020

griwodz left a comment

griwodz Nov 20, 2020

mitjap Nov 20, 2020 •

edited

Loading

griwodz Dec 24, 2020

mitjap Dec 24, 2020

griwodz Nov 20, 2020

mitjap Nov 20, 2020

mitjap commented Nov 20, 2020

mitjap commented Nov 20, 2020

mitjap commented Nov 20, 2020

griwodz left a comment

simogasp commented Dec 24, 2020

griwodz commented Jan 4, 2021

mitjap commented Jan 4, 2021

griwodz commented Jan 4, 2021

mitjap commented Jan 4, 2021

griwodz left a comment

add support for device selection and multiple GPUs #121

add support for device selection and multiple GPUs #121

Conversation

mitjap commented Nov 20, 2020

Description

Features list

Implementation remarks

mitjap commented Nov 20, 2020

griwodz left a comment

Choose a reason for hiding this comment

griwodz Nov 20, 2020

Choose a reason for hiding this comment

mitjap Nov 20, 2020 • edited Loading

Choose a reason for hiding this comment

griwodz Dec 24, 2020

Choose a reason for hiding this comment

mitjap Dec 24, 2020

Choose a reason for hiding this comment

griwodz Nov 20, 2020

Choose a reason for hiding this comment

mitjap Nov 20, 2020

Choose a reason for hiding this comment

mitjap commented Nov 20, 2020

mitjap commented Nov 20, 2020

mitjap commented Nov 20, 2020

griwodz left a comment

Choose a reason for hiding this comment

simogasp commented Dec 24, 2020

griwodz commented Jan 4, 2021

mitjap commented Jan 4, 2021

griwodz commented Jan 4, 2021

mitjap commented Jan 4, 2021

griwodz left a comment

Choose a reason for hiding this comment

mitjap Nov 20, 2020 •

edited

Loading