Real hostname: gn30.zhores (StarPU hostname: zhores.ais-gpu.starpu-1.4.4) Environment variables STARPU_HOSTNAME=zhores.ais-gpu.starpu-1.4.4 StarPU has found : 56 CPU workers: CPU 0 CPU 1 CPU 2 CPU 3 CPU 4 CPU 5 CPU 6 CPU 7 CPU 8 CPU 9 CPU 10 CPU 11 CPU 12 CPU 13 CPU 14 CPU 15 CPU 16 CPU 17 CPU 18 CPU 19 CPU 20 CPU 21 CPU 22 CPU 23 CPU 24 CPU 25 CPU 26 CPU 27 CPU 28 CPU 29 CPU 30 CPU 31 CPU 32 CPU 33 CPU 34 CPU 35 CPU 36 CPU 37 CPU 38 CPU 39 CPU 40 CPU 41 CPU 42 CPU 43 CPU 44 CPU 45 CPU 46 CPU 47 CPU 48 CPU 49 CPU 50 CPU 51 CPU 52 CPU 53 CPU 54 CPU 55 8 CUDA workers: CUDA 0.0 (NVIDIA A100-SXM4-80GB 71.2 GiB 27:00.0) CUDA 1.0 (NVIDIA A100-SXM4-80GB 71.2 GiB 2a:00.0) CUDA 2.0 (NVIDIA A100-SXM4-80GB 71.2 GiB 51:00.0) CUDA 3.0 (NVIDIA A100-SXM4-80GB 71.2 GiB 57:00.0) CUDA 4.0 (NVIDIA A100-SXM4-80GB 71.2 GiB 9e:00.0) CUDA 5.0 (NVIDIA A100-SXM4-80GB 71.2 GiB a4:00.0) CUDA 6.0 (NVIDIA A100-SXM4-80GB 71.2 GiB c7:00.0) CUDA 7.0 (NVIDIA A100-SXM4-80GB 71.2 GiB ca:00.0) No OpenCL worker No FPGA worker No MPI_MS worker No TCPIP_MS worker No HIP worker topology ... (hwloc logical indexes) numa 0 pack 0 core 0 PU 0 CUDA 0.0 (NVIDIA A100-SXM4-80GB 71.2 GiB 27:00.0) core 1 PU 1 CUDA 1.0 (NVIDIA A100-SXM4-80GB 71.2 GiB 2a:00.0) core 2 PU 2 CUDA 2.0 (NVIDIA A100-SXM4-80GB 71.2 GiB 51:00.0) core 3 PU 3 CPU 0 core 4 PU 4 CPU 1 core 5 PU 5 CPU 2 core 6 PU 6 CPU 3 core 7 PU 7 CPU 4 core 8 PU 8 CPU 5 core 9 PU 9 CPU 6 core 10 PU 10 CPU 7 core 11 PU 11 CPU 8 core 12 PU 12 CPU 9 core 13 PU 13 CPU 10 core 14 PU 14 CPU 11 core 15 PU 15 CPU 12 core 16 PU 16 CPU 13 core 17 PU 17 CPU 14 core 18 PU 18 CPU 15 core 19 PU 19 CPU 16 core 20 PU 20 CPU 17 core 21 PU 21 CPU 18 core 22 PU 22 CPU 19 core 23 PU 23 CPU 20 core 24 PU 24 CPU 21 core 25 PU 25 CPU 22 core 26 PU 26 CPU 23 core 27 PU 27 CPU 24 core 28 PU 28 CPU 25 core 29 PU 29 CPU 26 core 30 PU 30 CPU 27 core 31 PU 31 CPU 28 numa 1 pack 1 core 32 PU 32 CUDA 3.0 (NVIDIA A100-SXM4-80GB 71.2 GiB 57:00.0) core 33 PU 33 CUDA 4.0 (NVIDIA A100-SXM4-80GB 71.2 GiB 9e:00.0) core 34 PU 34 CUDA 5.0 (NVIDIA A100-SXM4-80GB 71.2 GiB a4:00.0) core 35 PU 35 CUDA 6.0 (NVIDIA A100-SXM4-80GB 71.2 GiB c7:00.0) core 36 PU 36 CUDA 7.0 (NVIDIA A100-SXM4-80GB 71.2 GiB ca:00.0) core 37 PU 37 CPU 29 core 38 PU 38 CPU 30 core 39 PU 39 CPU 31 core 40 PU 40 CPU 32 core 41 PU 41 CPU 33 core 42 PU 42 CPU 34 core 43 PU 43 CPU 35 core 44 PU 44 CPU 36 core 45 PU 45 CPU 37 core 46 PU 46 CPU 38 core 47 PU 47 CPU 39 core 48 PU 48 CPU 40 core 49 PU 49 CPU 41 core 50 PU 50 CPU 42 core 51 PU 51 CPU 43 core 52 PU 52 CPU 44 core 53 PU 53 CPU 45 core 54 PU 54 CPU 46 core 55 PU 55 CPU 47 core 56 PU 56 CPU 48 core 57 PU 57 CPU 49 core 58 PU 58 CPU 50 core 59 PU 59 CPU 51 core 60 PU 60 CPU 52 core 61 PU 61 CPU 53 core 62 PU 62 CPU 54 core 63 PU 63 CPU 55 bandwidth (MB/s) and latency (us)... from/to NUMA 0 CUDA 0 CUDA 1 CUDA 2 CUDA 3 CUDA 4 CUDA 5 CUDA 6 CUDA 7 NUMA 0 0 11508 14626 14675 14632 13534 14574 14580 14588 CUDA 0 11675 0 14754 14624 14614 14611 14678 14709 14705 CUDA 1 15307 13704 0 236542 240994 240604 241484 241585 241048 CUDA 2 15238 14517 243876 0 240905 243430 244061 244120 244174 CUDA 3 15290 13937 244394 246582 0 243745 243325 244119 243105 CUDA 4 15282 14553 240456 241586 243637 0 242883 244317 243701 CUDA 5 13949 15338 242639 244137 244110 247594 0 244177 244396 CUDA 6 13887 15326 242120 242093 244440 244385 248017 0 244753 CUDA 7 13978 15327 241820 242530 244382 244397 244414 247971 0 NUMA 0 0 0 10 9 9 10 9 9 9 CUDA 0 0 0 10 9 9 10 9 9 9 CUDA 1 12 12 0 14 14 14 14 14 14 CUDA 2 12 12 14 0 14 13 13 13 13 CUDA 3 11 12 14 13 0 13 13 13 13 CUDA 4 12 12 14 14 13 0 14 13 13 CUDA 5 12 12 13 13 12 12 0 12 12 CUDA 6 12 11 13 13 13 13 12 0 12 CUDA 7 12 11 13 13 13 13 12 12 0 GPU NUMA in preference order (logical index), host-to-device, device-to-host CUDA_0 0 1 CUDA_1 0 1 CUDA_2 0 1 CUDA_3 1 0 CUDA_4 1 0 CUDA_5 1 0 CUDA_6 1 0 CUDA_7 1 0