Skip to content
/ mesh Public
forked from tensorflow/mesh

Mesh TensorFlow: Model Parallelism Made Easier

License

Notifications You must be signed in to change notification settings

mzj14/mesh

 
 

Repository files navigation

Exploring RNN parallelism with Model Splitting

This repo is based on Mesh-Tensorflow. Please read its instruction before you continue with this readme.

Prerequisites

pip install mesh-tensorflow==0.0.5

Run

python rnn-bench.py --hidden_size=200 --train_epochs=1 --epochs_between_evals=1 --log_steps=1 --eval_steps=2 --mesh_shape="b1:2;b2:2" --layout="hidden_1:b1;hidden_2:b2"

Hint: Please change the mesh_shape and layout parameters based on your own cpu conditions.

Result

RNN training speed in different computation modes Amortized computation load in different computation modes
RNN training speed Amortized computation load

About

Mesh TensorFlow: Model Parallelism Made Easier

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 99.5%
  • Shell 0.5%