Armada

Armada is a multi-Kubernetes cluster batch job scheduler.

Armada is designed to address the following issues:

A single Kubernetes cluster can not be scaled indefinitely, and managing very large Kubernetes clusters is challenging. Hence, Armada is a multi-cluster scheduler built on top of several Kubernetes clusters.
Acheiving very high throughput using the in-cluster storage backend, etcd, is challenging. Hence, queueing and scheduling is performed partly out-of-cluster using a specialized storage layer.

Armada is designed primarily for machine learning, AI, and data analytics workloads, and to:

Armada is a CNCF Sandbox project used in production at G-Research.

For an overview of Armada, see this video.

Documentation

For an overview of the architecture and design of Armada, and instructions for submitting jobs, see:

For instructions of how to setup and develop Armada, see:

For API reference, see:

We expect readers of the documentation to have a basic understanding of Docker and Kubernetes; see, e.g., the following links:

Armada follows the CNCF Code of Conduct

Name		Name	Last commit message	Last commit date
Latest commit History 1,772 Commits
.circleci		.circleci
.github		.github
build		build
build_goreleaser		build_goreleaser
client		client
cmd		cmd
config		config
deployment		deployment
docs		docs
e2e		e2e
example		example
internal		internal
localdev		localdev
pkg		pkg
scripts		scripts
testsuite		testsuite
third_party/airflow		third_party/airflow
.gitignore		.gitignore
.gitpod.Dockerfile		.gitpod.Dockerfile
.gitpod.yml		.gitpod.yml
.golangci.yml		.golangci.yml
.goreleaser.yml		.goreleaser.yml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MAINTAINERS.md		MAINTAINERS.md
README.md		README.md
go.mod		go.mod
go.sum		go.sum
logo.svg		logo.svg
magefile.go		magefile.go
makefile		makefile
protoc-gen-armada.go		protoc-gen-armada.go