cifar10 - multi GPU training #162

sergeimonakhov · 2018-09-12T17:00:45Z

hi, i have 5 card amd x470, i run python3 ./cifar10_multi_gpu_train.py --num_gpus=5, i get visible one card in x16 pci. How work other cards in x1 pci?

The text was updated successfully, but these errors were encountered:

whchung · 2018-09-12T17:35:33Z

@D1abloRUS please refer to https://github.com/RadeonOpenCompute/ROCm#supported-cpus . RX470 is of GFX8 family so we don't support them on x1 PCIe yet.

sergeimonakhov · 2018-09-12T17:40:46Z

@whchung hmm ok. What about gfx7xx? 2018-09-12 17:16:59.854955: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1532] Ignoring visible gpu device (device: 2, name: Hawaii XT [Radeon R9 290X], pci bus id: 0000:03:00.0) with AMDGPU ISA gfx701. The minimum required AMDGPU ISA is gfx803. how run him?

whchung · 2018-09-12T17:45:07Z

Unfortunately Hawaii (GFX7) family is not in the roadmap. Quite a few DNN algorithms in MIOpen are implemented in GFX-specific assembly, so we are only focused on GFX8/GFX9 and upcoming architectures.

sergeimonakhov · 2018-09-12T18:12:58Z

@whchung

RX470 is of GFX8 family so we don't support them on x1 PCIe yet.

what about x8?

whchung · 2018-09-12T18:56:39Z

x8 should work. please check:
https://rocm.github.io/hardware.html

dagamayank · 2018-09-16T15:35:13Z

/cc @jlgreathouse to confirm supported hw list.

jlgreathouse · 2018-09-16T21:39:31Z

Hi @D1abloRUS

When you say "x8", "x1", etc. the major thing to ask is how are these GPUs connected to your CPU? In particular, gfx8 GPUs require PCIe Gen 3 atomics at every step between the CPU and the GPUs. Many people running multiple GPUs through x1 lanes are using PCIe switches to split off multiple ports from a single port. One of the major impediments here is that your PCIe switches must know how to properly forward PCIe atomic commands.

Note that this is true for "x1" or "x8". So if your "PCIe x8" solution also has a switch in between your CPU and your GPU(s), you will also need to make sure this switch properly handles atomics.

Towards that end, I'll ask:

What CPU are you using?
What motherboard are you using?
How are you connecting your GPUs to that motherboard?
- In other words, if your motherboard has X PCIe slots, which slot is each of your GPUs connected to, and how?

Thanks.

sunway513 · 2019-02-08T20:12:52Z

Closing this ticket as there're no more feedbacks.
@D1abloRUS feel free to reopen it if you have further questions.

This PR is a stepping stone towards supporting generic multi-store source loop nests in affine loop fusion. It extends the algorithm to support fusion of multi-store loop nests that: 1. have only one store that writes to a function-local live out, and 2. the remaining stores are involved in loop nest self dependences or no dependences within the function. Closes #162 COPYBARA_INTEGRATE_REVIEW=tensorflow/mlir#162 from dcaballe:dcaballe/multi-output-fusion 7fb7dec6fe8b45f5ce176f018bfe37b256420c45 PiperOrigin-RevId: 273773907

sunway513 closed this as completed Feb 8, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cifar10 - multi GPU training #162

cifar10 - multi GPU training #162

sergeimonakhov commented Sep 12, 2018

whchung commented Sep 12, 2018

sergeimonakhov commented Sep 12, 2018

whchung commented Sep 12, 2018

sergeimonakhov commented Sep 12, 2018 •

edited

Loading

whchung commented Sep 12, 2018

dagamayank commented Sep 16, 2018

jlgreathouse commented Sep 16, 2018

sunway513 commented Feb 8, 2019

cifar10 - multi GPU training #162

cifar10 - multi GPU training #162

Comments

sergeimonakhov commented Sep 12, 2018

whchung commented Sep 12, 2018

sergeimonakhov commented Sep 12, 2018

whchung commented Sep 12, 2018

sergeimonakhov commented Sep 12, 2018 • edited Loading

whchung commented Sep 12, 2018

dagamayank commented Sep 16, 2018

jlgreathouse commented Sep 16, 2018

sunway513 commented Feb 8, 2019

sergeimonakhov commented Sep 12, 2018 •

edited

Loading