Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cuda devices monitoring #218

Merged

Conversation

igor-davidyuk
Copy link
Contributor

@igor-davidyuk igor-davidyuk commented Oct 28, 2021

An attempt to allow assigning GPU devices through OpenFL.
The PR introduces an optional 'device monitor' plugin for Envoy and two information flows:

  1. GPU status goes from Envoy through Director to the Frontend.
  2. GPU utilization policy that goes the same path but in reversed order.

The Director_Pytorch_Kvasir_UNET example is modified to utilize the new infrastructure.
There are 2 envoys, one that utilizes GPUs and one that does not.

Device assignment for an experiment is done through 'device assignment policy' which may be ' CUDA_PREFFERED' or 'CPU_ONLY'

TODO:

  • Update docs
  • Update example Envoy config files
  • Update interactive API tests

Code discussion was in #162

@igor-davidyuk igor-davidyuk changed the title Cuda devices rebase 3 Cuda devices rebase Third attempt Oct 28, 2021
@igor-davidyuk igor-davidyuk changed the title Cuda devices rebase Third attempt Cuda devices monitoring Oct 29, 2021
@alexey-gruzdev alexey-gruzdev merged commit e8ff532 into securefederatedai:develop Oct 29, 2021
@github-actions github-actions bot locked and limited conversation to collaborators Oct 29, 2021
@alexey-gruzdev alexey-gruzdev added this to the v1.2.1 milestone Nov 1, 2021
@alexey-gruzdev alexey-gruzdev added the enhancement New feature or request label Nov 19, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants