Kubernetes on NVIDIA GPUs
Switch branches/tags
Nothing to show
Clone or download
Pramod Ramarao and guptaNswati Update README for running Kubernetes on NVIDIA GPUs
Signed-off-by: Swati Gupta <swatig@nvidia.com>
Latest commit 875873b Jun 19, 2018
Permalink
Failed to load latest commit information.
.github Merge pull request #54114 from xiangpengzhao/fix-pr-template Oct 31, 2017
Godeps Update vendor of google.golang.org/api repo Feb 10, 2018
api Regenerate the core API May 2, 2018
build Bump debian-iptables-amd64 digest for release 1.9 Mar 20, 2018
cluster Update kube-dns to Version 1.14.10. Major changes: - Fix a bug in DNS… Apr 16, 2018
cmd Add admission controller to convert old GPU spec to new GPU spec May 10, 2018
docs Regenerate the core API May 2, 2018
examples Update generated files Nov 9, 2017
hack Add admission controller to convert old GPU spec to new GPU spec May 10, 2018
logo Don't use strokes in the logo SVG Oct 12, 2017
pkg Merge branch 'scheduler-attributes' into 'master' May 10, 2018
plugin Add admission controller to convert old GPU spec to new GPU spec May 10, 2018
staging Updated Event test to take into account the new binding API May 2, 2018
test Update container manager with new device manager interface May 10, 2018
third_party Update generated files Nov 9, 2017
translations Merge pull request #47584 from zjj2wry/translate Nov 16, 2017
vendor Update vendor of google.golang.org/api repo Feb 10, 2018
.bazelrc move build related files out of the root directory May 15, 2017
.generated_files Move .generated_docs to docs/ so docs OWNERS can review / approve Feb 16, 2017
.gitattributes Hide openapi-spec in diffs Nov 5, 2017
.gitignore make clean will remove all gitignored files Sep 4, 2017
.kazelcfg.json Switch from gazel to kazel, and move kazelcfg into build/root Jul 18, 2017
BUILD.bazel move build related files out of the root directory May 15, 2017
CHANGELOG-1.9.md Add/Update CHANGELOG-1.9.md for v1.9.6. Mar 21, 2018
CHANGELOG.md Restore CHANGELOG.md Dec 1, 2017
CONTRIBUTING.md Close kubernetes/community#420 Mar 8, 2017
LICENSE LICENSE: revert modifications to Apache license Nov 22, 2016
Makefile move build related files out of the root directory May 15, 2017
Makefile.generated_files move build related files out of the root directory May 15, 2017
OWNERS Fix my incorrect username in #46649 Aug 10, 2017
OWNERS_ALIASES Add bsalamat to milestone maintainers Oct 26, 2017
README.md Update README for running Kubernetes on NVIDIA GPUs Jun 19, 2018
SUPPORT.md Add a SUPPORT.md file for github Aug 11, 2017
Vagrantfile Customizable vagrant rsync args and excludes Nov 14, 2016
WORKSPACE move build related files out of the root directory May 15, 2017
code-of-conduct.md Change code of conduct to call CNCF CoC by reference Oct 19, 2016
labels.yaml Merge pull request #51848 from xiangpengzhao/milestone-label Sep 5, 2017

README.md

Kubernetes on NVIDIA GPUs

Submit Queue Widget GoDoc Widget CII Best Practices


Kubernetes is an open source system for managing containerized applications across multiple hosts, providing basic mechanisms for deployment, maintenance, and scaling of applications.

Kubernetes builds upon a decade and a half of experience at Google running production workloads at scale using a system called Borg, combined with best-of-breed ideas and practices from the community.

Kubernetes on NVIDIA GPUs includes support for GPUs and enhancements to Kubernetes, so users can easily configure and use GPU resources for accelerating deep learning workloads.


To start using Kubernetes

Get started with Kubernetes on NVIDIA GPUs by reviewing the installation guide.

The general Kubernetes documentation is available at kubernetes.io.

Support

General troubleshooting guidelines are available in the documentation. Feel free to also open an issue on GitHub or post questions on the NVIDIA Developer Forums.

For general Kubernetes issues, start with the troubleshooting guide.

Release Highlights

Supported Platforms

This release of Kubernetes is supported on the following platforms.

On-Prem

Cloud

NVIDIA GPU Cloud virtual machine images available on Amazon EC2 and Google Cloud Platform.

New Features

  • Support for NVIDIA GPUs in Kubernetes using the NVIDIA device plugin
  • Support for GPU attributes such as GPU type and memory requirements via the Kubernetes PodSpec
  • Visualize and monitor GPU metrics and health with an integrated GPU monitoring stack of NVIDIA DCGM, Prometheus and Grafana
  • Support for Docker and CRI-O using the NVIDIA Container Runtime

Analytics