multi card support #356

vhnatyk · 2024-01-30T07:51:12Z

Describe the changes

This PR enables multi-gpu support

Linked Issues

Resolves #135

wrappers/rust/icicle-core/Cargo.toml

jeremyfelder · 2024-01-31T12:42:22Z

We should add documentation about how to use multiple devices. It wasn't clear to me at first how a Device slice could be set on anything but the default device.
Its also unclear how set_device() works with multithreading

wrappers/rust/icicle-core/src/ntt/mod.rs

wrappers/rust/icicle-cuda-runtime/src/device_context.rs

wrappers/rust/icicle-cuda-runtime/src/memory.rs

wrappers/rust/icicle-core/src/field.rs

wrappers/rust/icicle-core/src/msm/tests.rs

wrappers/rust/icicle-core/src/ntt/mod.rs

DmytroTym · 2024-02-04T09:11:10Z

Maybe because setting a new device in a context automatically invalidates the context and setting a different stream invalidates the context if this stream was created for a different device, we should re-design or at least encapsulate the context somehow? Though I guess it's a breaking change technically

wrappers/rust/rust-toolchain

DmytroTym

Really like the idea of one CPU thread = one device. Maybe we'll have some inconveniences integrating into third party applications that already have their own multithreading but stand-alone it seems very elegant and natural to me.

The only thing I don't really like is adding new functions for context and config creation that accept device_id. I feel like we can always just infer device id using cudaGetDevice especially considering that we want device id to be set once for the entire duration of the CPU thread. More generally, I feel the need to push back against all non-essential free floating functions being added to the public API. As discussed with @ChickenLover at some point, it feels like once there are too many of them - we should create a struct, a trait etc.

icicle/appUtils/ntt/ntt.cu

wrappers/rust/icicle-core/src/field.rs

wrappers/rust/icicle-core/src/ntt/mod.rs

wrappers/rust/icicle-core/src/msm/tests.rs

wrappers/rust/icicle-cuda-runtime/src/device_context.rs

DmytroTym · 2024-02-13T11:03:37Z

@vhnatyk okay last counter-proposal. Recently we moved get_default_ntt_config into Rust, this allowed us to actually implement it generically. Using this function seems to me much better than the free-floating functions as before. Can we from now on use NTTConfig::default_config and NTTConfig::default_config_for_device, not add get_default_ntt_config_for_device and schedule get_default_ntt_config for deprecation? Maybe the same can be done for the DeviceContext struct, not sure why would I put a free-floating function there, for consistency with configs I guess, but now since we added another initialiser I would definitely prefer DeviceContext::default_context and DeviceContext::default_context_for_device.

…card-support

examples/rust/msm/src/main.rs

wrappers/rust/icicle-core/src/msm/tests.rs

DmytroTym

lgtm

…vhnat/Multi-card-support

## Contents of this release [FEAT]: support for multi-device execution: #356 [FEAT]: full support for new mixed-radix NTT: #367, #368 and #371 [FEAT]: examples for Poseidon hash and tree builder based on it (currently only on C++ side): #375 [PERF]: MSM performance upgrades & zero point handling: #372

Vitalii and others added 6 commits January 27, 2024 15:10

remove outdated file

80c0237

minor

08e68e0

ntt multicard draft - tested from same host thread

14512b1

working multigpu ntt/msm

a216050

comment

4f344fb

send+sync for field

3f37835

vhnatyk requested review from DmytroTym, ChickenLover, jeremyfelder, ImmanuelSegol, yshekel, bigsky77 and LeonHibnik January 30, 2024 07:51

vhnatyk added 3 commits January 30, 2024 18:14

removed dummy test

6f8e009

device_id bound slice

5f635f8

rust format

aa324e3

vhnatyk marked this pull request as ready for review January 31, 2024 02:47

Merge branch 'dev' into develop/vhnat/Multi-card-support

4614092

jeremyfelder reviewed Jan 31, 2024

View reviewed changes

wrappers/rust/icicle-core/Cargo.toml Outdated Show resolved Hide resolved

ChickenLover requested changes Jan 31, 2024

View reviewed changes

bigsky77 reviewed Jan 31, 2024

View reviewed changes

wrappers/rust/icicle-core/src/msm/tests.rs Show resolved Hide resolved

Vitalii added 3 commits February 1, 2024 12:25

removed set_device_id from slice - pr comments

eb040af

pr comments

7341864

warning - unused import

1efdb3b

DmytroTym reviewed Feb 2, 2024

View reviewed changes

wrappers/rust/icicle-core/src/ntt/mod.rs Outdated Show resolved Hide resolved

Vitalii added 3 commits February 4, 2024 10:56

removed implicit set_device

5d2cd3e

removed todo

74e2bdd

fix - still required

831e45a

ChickenLover reviewed Feb 13, 2024

View reviewed changes

wrappers/rust/rust-toolchain Outdated Show resolved Hide resolved

DmytroTym reviewed Feb 13, 2024

View reviewed changes

Vitalii added 7 commits February 13, 2024 23:22

restored all way

3da9099

fix for missed reason

fb35b27

domain-per-device as array

f1c04eb

Merge remote-tracking branch 'upstream/dev' into develop/vhnat/Multi-…

effd2c1

…card-support

fix for concurrent domain initialization

c3b8708

cleanup

02a915d

refactor config and domain

4e1e9e4

vhnatyk requested review from DmytroTym, ChickenLover, bigsky77 and yshekel February 14, 2024 13:37

Vitalii added 3 commits February 14, 2024 15:16

fix mutex

032e5a0

removed deprecated method as discussed

80e0e3d

msm config refatoring

dd60929

DmytroTym reviewed Feb 14, 2024

View reviewed changes

examples/rust/msm/src/main.rs Show resolved Hide resolved

DmytroTym reviewed Feb 14, 2024

View reviewed changes

wrappers/rust/icicle-core/src/msm/tests.rs Outdated Show resolved Hide resolved

Vitalii added 2 commits February 14, 2024 18:25

pr comment

c4565a9

same for rest config objects

a1619df

DmytroTym approved these changes Feb 14, 2024

View reviewed changes

yshekel approved these changes Feb 14, 2024

View reviewed changes

Vitalii added 3 commits February 14, 2024 19:46

Merge commit '0d70a0c003c44755af3c4cbfe5c3cdb04ad33d85' into develop/…

632d558

…vhnat/Multi-card-support

cargo fmt

1c6a40f

possible fix for windows build

eef6876

vhnatyk merged commit 7742509 into ingonyama-zk:dev Feb 14, 2024
14 checks passed

DmytroTym mentioned this pull request Feb 15, 2024

Release v1.4.0 #378

Merged

vhnatyk deleted the develop/vhnat/Multi-card-support branch February 16, 2024 07:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multi card support #356

multi card support #356

vhnatyk commented Jan 30, 2024

jeremyfelder commented Jan 31, 2024 •

edited

DmytroTym commented Feb 4, 2024 •

edited

DmytroTym left a comment •

edited

DmytroTym commented Feb 13, 2024 •

edited

DmytroTym left a comment

multi card support #356

multi card support #356

Conversation

vhnatyk commented Jan 30, 2024

Describe the changes

Linked Issues

jeremyfelder commented Jan 31, 2024 • edited

DmytroTym commented Feb 4, 2024 • edited

DmytroTym left a comment • edited

Choose a reason for hiding this comment

DmytroTym commented Feb 13, 2024 • edited

DmytroTym left a comment

Choose a reason for hiding this comment

jeremyfelder commented Jan 31, 2024 •

edited

DmytroTym commented Feb 4, 2024 •

edited

DmytroTym left a comment •

edited

DmytroTym commented Feb 13, 2024 •

edited