Armada

Armada is a multi-Kubernetes cluster batch job scheduler.

Armada is designed to address the following issues:

A single Kubernetes cluster can not be scaled indefinitely, and managing very large Kubernetes clusters is challenging. Hence, Armada is a multi-cluster scheduler built on top of several Kubernetes clusters.
Achieving very high throughput using the in-cluster storage backend, etcd, is challenging. Hence, queueing and scheduling is performed partly out-of-cluster using a specialized storage layer.

Armada is designed primarily for machine learning, AI, and data analytics workloads, and to:

Manage compute clusters composed of tens of thousands of nodes in total.
Schedule a thousand or more pods per second, on average.
Enqueue tens of thousands of jobs over a few seconds.
Divide resources fairly between users.
Provide visibility for users and admins.
Ensure near-constant uptime.

Armada is a CNCF Sandbox project used in production at G-Research.

For an overview of Armada, see these videos:

Armada adheres to the CNCF Code of Conduct.

Documentation

For an overview of the architecture and design of Armada, and instructions for submitting jobs, see:

For a full developer guide, see:

Development guide

For API reference, see:

API Documentation

We expect readers of the documentation to have a basic understanding of Docker and Kubernetes; see, e.g., the following links:

Contributions

Thank you for considering contributing to Armada! We want everyone to feel that they can contribute to the Armada Project. Your contributions are valuable, whether it's fixing a bug, implementing a new feature, improving documentation, or suggesting enhancements. We appreciate your time and effort in helping make this project better for everyone. For more information about contributing to Armada see CONTRIBUTING.md and before proceeding to contributions see CODE_OF_CONDUCT.md

Discussion

If you are interested in discussing Armada you can find us on

Name		Name	Last commit message	Last commit date
Latest commit History 2,626 Commits
.devcontainer/demo		.devcontainer/demo
.github		.github
.run		.run
build		build
build_goreleaser		build_goreleaser
client		client
cmd		cmd
config		config
deployment		deployment
developer		developer
docs		docs
e2e		e2e
example		example
internal		internal
magefiles		magefiles
pkg		pkg
plugins		plugins
scripts		scripts
testsuite		testsuite
third_party/airflow		third_party/airflow
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.golangci.yml		.golangci.yml
.goreleaser.yml		.goreleaser.yml
.mergify.yml		.mergify.yml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MAINTAINERS.md		MAINTAINERS.md
README.md		README.md
docker-compose.yaml		docker-compose.yaml
go.mod		go.mod
go.sum		go.sum
logo.svg		logo.svg
tools.yaml		tools.yaml

License

lowang-bh/armada

Folders and files

Latest commit

History

Repository files navigation

Armada

Documentation

Contributions

Discussion

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Languages