Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Already on GitHub? Sign in to your account

support for running containers on nodes with GPU #194

Open
anaderi opened this Issue Mar 31, 2017 · 0 comments

Comments

Projects
None yet
2 participants
Owner

anaderi commented Mar 31, 2017

to run code inside container that runs something on nvidia GPU, one has to take care of certain options for docker that tell it to bind device and drivers inside container (of course image it run should be created accordingly -- TODO: describe on wiki).
there is a helper project that takes care of everything: https://github.com/NVIDIA/nvidia-docker, it creates a special service (listens to http://localhost:3476 and tells additional arguments to docker: https://github.com/NVIDIA/nvidia-docker/wiki/nvidia-docker-plugin, i.e. http://localhost:3476/docker/cli/json returns json with all additional args that would enable running GPU-specific stuff inside container, i.e. nvidia-smi)
in particular there are devices arguments that docker-py of version 2.0+ can understand.
So the task would be to

  • upgrade docker-py to 2.2.1
  • if running on single docker (not swarm) - check nvidia-docker-plugin availability, and if it is there, get the arguments and pass it to docker.client.run
  • if running on docker swarm, there is a blocker: NVIDIA/nvidia-docker#141

malchun self-assigned this May 11, 2017

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment