Transformer distributed training on GKE

This project demonstrates the training of a Transformer on GCP GKE. Distributed training and GPUs are used.

The project uses 4 main technologies to do this:

TensorFlow 2 on Python 3 for the distributed model training and dynamic (during execution) control of the training process, including integrating with Kubernetes and Google Cloud.
Docker to package the code in a standard way so it can be run on an execution platform (in this case Kubernetes).
Kubernetes as a distributed execution platform.
Google Cloud for cloud file storage and to host the Kubernetes cluster.

The project assumes you have a basic understanding of all the above technologies, and is intended as a demonstration of how to get them to run together end-to-end. It is not intended as an individual first introduction to them.

To get started using the project, see documentation/running.md

For any feedback, questions, or contributions, please see documentation/contributing.md

Developed in association with ML Collective

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
configuration		configuration
distributed_training_transformer		distributed_training_transformer
documentation		documentation
kubernetes		kubernetes
saved_weights		saved_weights
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

configuration

configuration

distributed_training_transformer

distributed_training_transformer

documentation

documentation

kubernetes

kubernetes

saved_weights

saved_weights

.gitignore

.gitignore

Dockerfile

Dockerfile

README.md

README.md

setup.py

setup.py

Repository files navigation

Transformer distributed training on GKE

About

Releases

Packages

Languages

ddrakard/tensorflow-distributed-on-gke

Folders and files

Latest commit

History

Repository files navigation

Transformer distributed training on GKE

About

Resources

Stars

Watchers

Forks

Languages