Skip to content
This repository has been archived by the owner on Jun 6, 2024. It is now read-only.

add gpu temperature detection #2757

Merged
merged 1 commit into from
May 15, 2019
Merged

add gpu temperature detection #2757

merged 1 commit into from
May 15, 2019

Conversation

xudifsd
Copy link
Member

@xudifsd xudifsd commented May 14, 2019

required by DLWS, may also be useful to pai.

return self.gpu_util == o.gpu_util and \
self.gpu_mem_util == o.gpu_mem_util and \
self.pids == o.pids and \
self.ecc_errors == o.ecc_errors and \
self.minor == o.minor and \
self.uuid == o.uuid
self.uuid == o.uuid and \
self.temperature == o.temperature


def parse_smi_xml_result(smi):
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's better to use NVIDIA System Management Interface (NVML), or its Python bindings to get GPU status, which is backwards compatible, while nvidia-smi's output is not guaranteed to be backwards compatible according to its docs.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ok, will try to investigate and use them in future PR.

@coveralls
Copy link

Coverage Status

Coverage remained the same at 53.064% when pulling e6d19fd on dixu/gpu-temperature into a32a51c on master.

3 similar comments
@coveralls
Copy link

Coverage Status

Coverage remained the same at 53.064% when pulling e6d19fd on dixu/gpu-temperature into a32a51c on master.

@coveralls
Copy link

Coverage Status

Coverage remained the same at 53.064% when pulling e6d19fd on dixu/gpu-temperature into a32a51c on master.

@coveralls
Copy link

Coverage Status

Coverage remained the same at 53.064% when pulling e6d19fd on dixu/gpu-temperature into a32a51c on master.

@xudifsd xudifsd merged commit 101ccad into master May 15, 2019
@xudifsd xudifsd deleted the dixu/gpu-temperature branch May 15, 2019 03:26
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants