Pipelined Swarm Training

Swarm training "framework" using Haiku + Jax + Ray.

Designed for training large language models in a model parallel fashion with unreliable, heterogeneous nodes. (eventually)

Look in swarm_run.py for an example of running a character transformer on enwik8.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
scripts		scripts
swarm_jax		swarm_jax
loader.py		loader.py
ray_tpu.py		ray_tpu.py
readme.md		readme.md
setup.py		setup.py
swarm_run.py		swarm_run.py
swarm_run_tpu.py		swarm_run_tpu.py

Provide feedback