fix(cuda): fix multi-GPU implementation when there are few inputs #149

agnesLeroy · 2022-08-31T09:07:50Z

Resolves: https://github.com/zama-ai/concrete-core-internal/issues/382 and https://github.com/zama-ai/concrete-core-internal/issues/389

Description

Fix multi GPU when there are few inputs: in case the number of GPUs is lower than the number of inputs, restrict the number of GPUs used to the number of inputs.
Also, change some types so that it feels more natural to use them.

Checklist

(Use '[x]' to check the checkboxes, or submit the PR and then click the checkboxes)

Tests for the changes have been added (for bug fixes / features)
Docs have been added / updated (for bug fixes / features)
The PR description links to the related issue (to link an issue, use '#XXX'.)
The tests on AWS have been launched and are successful (comment with @slab-ci cpu_test and/or @slab-ci gpu_test to trigger the tests)
The draft release description has been updated
Check for breaking changes (including serialization changes) and add them to commit message following the conventional commit specification

agnesLeroy · 2022-08-31T09:08:27Z

@slab-ci gpu_test

agnesLeroy · 2022-08-31T09:08:38Z

@slab-ci cpu_test

agnesLeroy · 2022-08-31T10:53:55Z

@slab-ci cpu_test

concrete-core/src/backends/cuda/implementation/engines/mod.rs

concrete-core/src/backends/cuda/implementation/engines/cuda_engine/mod.rs

...ckends/cuda/implementation/engines/cuda_engine/lwe_ciphertext_vector_discarding_bootstrap.rs

agnesLeroy · 2022-09-01T11:35:39Z

@slab-ci gpu_test

agnesLeroy · 2022-09-01T11:35:46Z

@slab-ci cpu_test

agnesLeroy · 2022-09-01T15:34:06Z

@slab-ci gpu_test

IceTDrinker · 2022-09-01T15:42:47Z

...te-core/src/backends/cuda/implementation/engines/cuda_engine/lwe_bootstrap_key_conversion.rs

+        let data_per_gpu = input.glwe_dimension().to_glwe_size().0
+            * input.glwe_dimension().to_glwe_size().0
+            * input.input_lwe_dimension().0
+            * input.decomposition_level_count().0
+            * input.polynomial_size().0;
+        let size = data_per_gpu as u64 * std::mem::size_of::<u32>() as u64;


there already is the need for the compiler in practice https://github.com/zama-ai/concrete-core-internal/issues/188

pdroalves · 2022-09-01T16:07:31Z

Impressive work refactoring the CUDA backend, @agnesLeroy !

IceTDrinker

Generally looks good 🙂 great effort !

concrete-core/src/backends/cuda/private/crypto/bootstrap/mod.rs

concrete-core/src/backends/cuda/private/mod.rs

concrete-core/src/backends/cuda/private/device.rs

concrete-core/src/backends/cuda/private/crypto/lwe/list.rs

agnesLeroy · 2022-09-02T08:30:32Z

@slab-ci gpu_test

agnesLeroy · 2022-09-02T08:44:50Z

@slab-ci gpu_test

IceTDrinker

A few things still (sorry)

...ore/src/backends/cuda/implementation/engines/cuda_engine/lwe_ciphertext_vector_conversion.rs

concrete-core/src/backends/cuda/private/crypto/bootstrap/mod.rs

concrete-core/src/backends/cuda/private/crypto/lwe/list.rs

- also, change some types so that it feels more natural to use them - refactor the whole backend to reduce the probability of bugs

agnesLeroy · 2022-09-02T09:15:44Z

@slab-ci gpu_test

agnesLeroy · 2022-09-02T11:26:38Z

@IceTDrinker normally I took all the comments into account, GPU tests are green! 🙂

IceTDrinker

Looks good as far as I can tell !

For the twiddle thing can it be something that the fft does by itself ?

agnesLeroy · 2022-09-02T13:33:21Z

Looks good as far as I can tell !

For the twiddle thing can it be something that the fft does by itself ?

Maybe, we need to do some benchmarking to check the effect on performance.

cla-bot bot added the cla-signed label Aug 31, 2022

agnesLeroy requested a review from pdroalves August 31, 2022 09:19

agnesLeroy mentioned this pull request Aug 31, 2022

Add views on LWE ciphertext vectors #108

Merged

6 tasks

agnesLeroy requested a review from IceTDrinker August 31, 2022 10:55

IceTDrinker reviewed Aug 31, 2022

View reviewed changes

agnesLeroy force-pushed the fix/multi_gpu branch from 918f2e5 to f8ef830 Compare September 1, 2022 11:11

agnesLeroy force-pushed the fix/multi_gpu branch from f8ef830 to c7da817 Compare September 1, 2022 15:33

agnesLeroy requested a review from IceTDrinker September 1, 2022 15:34

IceTDrinker reviewed Sep 1, 2022

View reviewed changes

agnesLeroy force-pushed the fix/multi_gpu branch from c7da817 to 5744513 Compare September 2, 2022 08:30

agnesLeroy mentioned this pull request Sep 2, 2022

Fix/cuda memory bug #158

Merged

6 tasks

agnesLeroy force-pushed the fix/multi_gpu branch from 5744513 to a03f4be Compare September 2, 2022 08:44

IceTDrinker reviewed Sep 2, 2022

View reviewed changes

...ore/src/backends/cuda/implementation/engines/cuda_engine/lwe_ciphertext_vector_conversion.rs Outdated Show resolved Hide resolved

concrete-core/src/backends/cuda/private/crypto/bootstrap/mod.rs Outdated Show resolved Hide resolved

IceTDrinker reviewed Sep 2, 2022

View reviewed changes

concrete-core/src/backends/cuda/private/crypto/lwe/list.rs Outdated Show resolved Hide resolved

agnesLeroy force-pushed the fix/multi_gpu branch from a03f4be to 42315ac Compare September 2, 2022 09:04

fix(cuda): fix multi-GPU implementation when there are few inputs

dfeaeba

- also, change some types so that it feels more natural to use them - refactor the whole backend to reduce the probability of bugs

agnesLeroy force-pushed the fix/multi_gpu branch from 42315ac to dfeaeba Compare September 2, 2022 09:15

IceTDrinker approved these changes Sep 2, 2022

View reviewed changes

agnesLeroy merged commit fe1f719 into main Sep 2, 2022

agnesLeroy deleted the fix/multi_gpu branch September 2, 2022 13:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(cuda): fix multi-GPU implementation when there are few inputs #149

fix(cuda): fix multi-GPU implementation when there are few inputs #149

agnesLeroy commented Aug 31, 2022 •

edited

agnesLeroy commented Aug 31, 2022

agnesLeroy commented Aug 31, 2022

agnesLeroy commented Aug 31, 2022

agnesLeroy commented Sep 1, 2022

agnesLeroy commented Sep 1, 2022

agnesLeroy commented Sep 1, 2022

IceTDrinker Sep 1, 2022

pdroalves commented Sep 1, 2022

IceTDrinker left a comment

agnesLeroy commented Sep 2, 2022

agnesLeroy commented Sep 2, 2022

IceTDrinker left a comment

agnesLeroy commented Sep 2, 2022

agnesLeroy commented Sep 2, 2022

IceTDrinker left a comment

agnesLeroy commented Sep 2, 2022

fix(cuda): fix multi-GPU implementation when there are few inputs #149

fix(cuda): fix multi-GPU implementation when there are few inputs #149

Conversation

agnesLeroy commented Aug 31, 2022 • edited

Resolves: https://github.com/zama-ai/concrete-core-internal/issues/382 and https://github.com/zama-ai/concrete-core-internal/issues/389

Description

Checklist

agnesLeroy commented Aug 31, 2022

agnesLeroy commented Aug 31, 2022

agnesLeroy commented Aug 31, 2022

agnesLeroy commented Sep 1, 2022

agnesLeroy commented Sep 1, 2022

agnesLeroy commented Sep 1, 2022

IceTDrinker Sep 1, 2022

Choose a reason for hiding this comment

pdroalves commented Sep 1, 2022

IceTDrinker left a comment

Choose a reason for hiding this comment

agnesLeroy commented Sep 2, 2022

agnesLeroy commented Sep 2, 2022

IceTDrinker left a comment

Choose a reason for hiding this comment

agnesLeroy commented Sep 2, 2022

agnesLeroy commented Sep 2, 2022

IceTDrinker left a comment

Choose a reason for hiding this comment

agnesLeroy commented Sep 2, 2022

agnesLeroy commented Aug 31, 2022 •

edited