Utilization 0% on all but one GPU using torch #772

creepyghost · 2016-05-23T14:30:12Z

Hi, I was trying to run the dbpedia text-classification example ( https://github.com/NVIDIA/DIGITS/tree/master/examples/text-classification ) on an Ubuntu 14.04 server with 4x Tesla K10s. The job started but GPU utilization is 99% only on one GPU. The other GPUs are stuck at 0%. I can see luajit processes in nvidia-smi on the other GPUs though. I can also see 6-8% memory utilization on the three GPUs as opposed to the 50%+ on the GPU that seems to be in use in the training phase.

Is this expected behaviour?

gheinrich · 2016-05-23T14:55:36Z

HI @creepyghost thanks for the feedback. Yes this is the expected behaviour for now as I haven't made the couple of changes required to do multi-GPU training in the text classification model. If you wish to try it as an exercise you have to encapsulate the model in a DataParallelTable as in there

gheinrich · 2016-06-14T08:22:45Z

@creepyghost are you willing to test #828 which I think should fix this issue?

gheinrich · 2016-07-29T12:30:37Z

No feedback => closing.

creepyghost changed the title ~~Utilization 0% on all but one GPU using torch via digits~~ Utilization 0% on all but one GPU using torch May 23, 2016

lukeyeager added question torch documentation labels May 23, 2016

gheinrich closed this as completed Jul 29, 2016

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Utilization 0% on all but one GPU using torch #772

Utilization 0% on all but one GPU using torch #772

creepyghost commented May 23, 2016

gheinrich commented May 23, 2016

gheinrich commented Jun 14, 2016

gheinrich commented Jul 29, 2016

Utilization 0% on all but one GPU using torch #772

Utilization 0% on all but one GPU using torch #772

Comments

creepyghost commented May 23, 2016

gheinrich commented May 23, 2016

gheinrich commented Jun 14, 2016

gheinrich commented Jul 29, 2016