Torch error when testing a model trained with multiple GPUs #736

lukeyeager · 2016-05-13T23:23:03Z

digits.inference.errors.InferenceError:
  torch classify one task failed with error message -
  ...6-05-12/install/share/lua/5.1/cunn/DataParallelTable.lua:374:
  Model was serialized on 2 nGPUs, but you are running on 1 please set 
  DataParallelTable.deserializeNGPUs to ignore  serialized tower-GPU assignments

@gheinrich Would #732 fix this?

The text was updated successfully, but these errors were encountered:

Datapoints: MNIST+LeNet (30 epochs) 1 GPU: 56s 2 GPUs: 2m51s (not unexpected due to communication overhead) Upscaled CIFAR + Alexnet (10 epochs): 1 GPU: 13m11s 2 GPUs: 13m7s Upscaled CIFAR + Googlenet (2 epochs): 1 GPU: 16m20s 2 GPUs: 11m13s Fix NVIDIA#736

gheinrich · 2016-05-17T11:22:42Z

Thanks for the bug report, I have updated the commit on #734 to fix this (with the new programming model we also need to set the number of GPUs when we deserialize a model when doing inference or fine-tuning).

lukeyeager · 2016-05-17T16:57:34Z

^ I think you meant #732?

gheinrich · 2016-05-17T19:26:27Z

I think you meant #732?

Whoops. Indeed!

Datapoints: MNIST+LeNet (30 epochs) 1 GPU: 56s 2 GPUs: 2m51s (not unexpected due to communication overhead) Upscaled CIFAR + Alexnet (10 epochs): 1 GPU: 13m11s 2 GPUs: 13m7s Upscaled CIFAR + Googlenet (2 epochs): 1 GPU: 16m20s 2 GPUs: 11m13s Fix NVIDIA#736

lukeyeager added bug torch labels May 13, 2016

lukeyeager closed this as completed in c30c4c8 May 17, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Torch error when testing a model trained with multiple GPUs #736

Torch error when testing a model trained with multiple GPUs #736

lukeyeager commented May 13, 2016 •

edited

Loading

gheinrich commented May 17, 2016

lukeyeager commented May 17, 2016

gheinrich commented May 17, 2016

Torch error when testing a model trained with multiple GPUs #736

Torch error when testing a model trained with multiple GPUs #736

Comments

lukeyeager commented May 13, 2016 • edited Loading

gheinrich commented May 17, 2016

lukeyeager commented May 17, 2016

gheinrich commented May 17, 2016

lukeyeager commented May 13, 2016 •

edited

Loading