Model Parallelism #5807

hondaathma · 2017-04-12T22:41:27Z

Does MXNET support model parallelism?

What I mean is I have 2 GPUs (12GB each).A single Caffe model (Resnet152 with some changes and additions) exceeds 12GB memory and does not fit in a single GPU for training. How can I solve that problem with 2 GPUs?
Can I split this huge model to both the GPUs and make sure they communicate gradients with each other?
If so what all should I change in the solver/train files in MXNET?

eric-haibin-lin · 2017-04-12T22:56:38Z

http://mxnet.io/how_to/model_parallel_lstm.html

KeyKy · 2017-08-18T06:34:33Z

Is there any other easier examples about model parallel? I want to put resnet in gpu2 and lenet in gpu0

kaonashi-tyc · 2017-12-12T04:00:00Z

The example in the tutorial is no longer available

hondaathma changed the title ~~MultiModel Parallelism~~ Model Parallelism Apr 12, 2017

tqchen closed this as completed Oct 19, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model Parallelism #5807

Model Parallelism #5807

hondaathma commented Apr 12, 2017 •

edited

eric-haibin-lin commented Apr 12, 2017 •

edited

KeyKy commented Aug 18, 2017 •

edited

kaonashi-tyc commented Dec 12, 2017

Model Parallelism #5807

Model Parallelism #5807

Comments

hondaathma commented Apr 12, 2017 • edited

eric-haibin-lin commented Apr 12, 2017 • edited

KeyKy commented Aug 18, 2017 • edited

kaonashi-tyc commented Dec 12, 2017

hondaathma commented Apr 12, 2017 •

edited

eric-haibin-lin commented Apr 12, 2017 •

edited

KeyKy commented Aug 18, 2017 •

edited