TF Pipeline Model Parallel Experiments

What is this

In model parallelism (not data parallelism!), different partitions of the model typically execute sequentially on each GPU. We've seen things like GPipe (Huang et al., 2018) that allow the execution to be pipelined, hence keeping each GPU busy instead of waiting for each other. However, the implementation is pretty complicated. Can we implement something similar using just the tf.keras API? Turns out, sort of.

Toy Experiment (small CNN model)

In the diagrams below, only the forward pass is annotated with NVTX.

Normal Model Parallel

Here, forward_1 and forward_2 run on different GPUs sequentially.

Pipeline Model Parallel

Here, forward_1 and forward_2 run on different GPUs. However, the data is partitioned, and the forward pass happens as follows:

data is split into data_1 and data_2
GPU_0 runs forward_1(data_1)
GPU_0 runs forward_1(data_2) while GPU_1 runs forward_2(data_1)
GPU_1 runs forward_2(data_2)

The end result is overlapped execution of the forward pass (and MemCopy too!):

With a simple toy model, the training throughput increase is about 10%.

However, since the backprop part doesn't seem to be getting any speed increase, there isn't much benefit. Perhaps we should start looking at how to do the equivalent for backprop?

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
images		images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
models.py		models.py
old_notebook.ipynb		old_notebook.ipynb
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TF Pipeline Model Parallel Experiments

What is this

Toy Experiment (small CNN model)

About

Releases

Packages

Contributors 2

Languages

License

tlkh/tf-pipeline-model-parallel

Folders and files

Latest commit

History

Repository files navigation

TF Pipeline Model Parallel Experiments

What is this

Toy Experiment (small CNN model)

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages