Seems full offload works. But it uses only 1 GPU
Tue Jan 7 14:59:47 2025
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 560.28.03 Driver Version: 560.28.03 CUDA Version: 12.6 |
|-----------------------------------------+------------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 NVIDIA GeForce RTX 3090 Off | 00000000:01:00.0 Off | N/A |
| 81% 69C P2 388W / 390W | 22265MiB / 24576MiB | 100% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
| 1 NVIDIA GeForce RTX 3090 Off | 00000000:45:00.0 Off | N/A |
| 0% 38C P8 25W / 370W | 18MiB / 24576MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
| 2 NVIDIA GeForce RTX 3090 Off | 00000000:C1:00.0 Off | N/A |
| 0% 57C P8 21W / 350W | 18MiB / 24576MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
| 3 NVIDIA GeForce RTX 3090 Off | 00000000:C2:00.0 Off | N/A |
| 0% 49C P8 48W / 370W | 133MiB / 24576MiB | 1% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
Is there any way to launch it on multiple GPUs? (And probably disable some offloads for better performance)
Seems full offload works. But it uses only 1 GPU
Is there any way to launch it on multiple GPUs? (And probably disable some offloads for better performance)