use too much CPU resource #54

matthew-z · 2018-11-08T14:50:02Z

Hi, gpustat -i uses about 80% CPU on my machine, is it expected or a bug?
In contrast, nvidia-smi -l 1 uses less than 10%.

OS: Ubuntu 18.04
Nv Driver: 410.48
CUDA: 10.0.130
CPU: AMD Threadripper 1900x
GPU: 2080Ti + 1080

The text was updated successfully, but these errors were encountered:

Stonesjtu · 2018-11-09T01:18:15Z

whats your interval value

matthew-z · 2018-11-09T05:10:43Z

I didn't set, but gpustat -i 1 will reproduce the same result.

Stonesjtu · 2018-11-09T07:30:33Z

Can you test if watch -n 1 nvidia-smi and watch -n gpustat use the same amount of CPU time.

matthew-z · 2018-11-09T07:35:23Z

I tested, and with watch -n 1 they use the same amount of CPU time (about 0-20%).

matthew-z · 2018-11-09T07:40:53Z

Hi, I just found that the CPU time problem of gpustat -i can be solved by running nvidia-smi daemon first.

wookayin · 2018-11-09T21:20:40Z

A difference is that in the watch mode (i.e. gpustat -i) handle resources are fetched at every time step, which is somewhat expensive. Therefore we could optimize in a way that GPU handles are fetched only once in the beginning, and use the (cached) resources. This would be possible in the watch mode as the gpustat process won't terminate until interrupted.

wookayin · 2018-11-09T21:23:10Z

Top four most expensive operations:

 36.50%  36.50%    4.59s     4.59s   nvmlDeviceGetHandleByIndex (pynvml.py:945)
 16.50%  16.50%    1.85s     1.85s   nvmlDeviceGetPowerUsage (pynvml.py:1289)
  9.50%   9.50%    1.18s     1.18s   nvmlDeviceGetUtilizationRates (pynvml.py:1379)
  7.50%   7.50%   0.805s    0.805s   nvmlDeviceGetComputeRunningProcesses (pynvml.py:1435)

Stonesjtu · 2018-11-10T03:26:38Z

@wookayin Good point, I'm working on that.

Querying power status (current draw and limit -- may be cached?) is quite expensive. Thus we can skip querying power status unless it is explicitly requested. This leads to 50% less CPU usage in my case. The default behavior of `new_query()` still includes power information. TODO: Add unit test case for the behavior

wookayin · 2019-02-24T02:12:58Z

Working on this as #61.

In my case querying power usage is most expensive, so I made it optional whenever possible. Could anybody check whether it leads to less CPU usage?

rkooo567 · 2023-05-31T01:12:08Z

Has this issue been resolved? I am observing this behavior from https://github.com/ray-project/ray/ when we run gpustat.new_query() repetitively at GCE.

rkooo567 · 2023-05-31T01:14:29Z

Lots of time is spent on NvmlInit & shutdown & nvmlDeviceGetHandleByIndex

Lots of time is spent on nvmlInit() and nvmlShutdown() for each new_query call. When running in a loop mode (-i), we do not need to initialize and shutdown the nvml library because nvml APIs will be used throughout the lifespan of the gpustat process. Upon importing `gpustat.pynvml`, nvmlInit() will always be called.

wookayin · 2023-11-24T07:33:54Z

In the recent versions of pynvml, nvmlDeviceGetHandleByIndex doesn't seem to be a bottleneck according to profiling result (If this is still slow, please let me know) so I did not optimize on redundant calls of nvmlDeviceGetHandleByIndex. #166 makes nvmlInit() called only once, so it should have some performance benefit.

Stonesjtu added a commit that referenced this issue Nov 9, 2018

Add reference to issue #54

51d63ee

wookayin added the enhancement label Nov 10, 2018

wookayin added this to the 0.6 milestone Nov 10, 2018

Stonesjtu mentioned this issue Feb 8, 2019

[WIP] Fix redundant nvml init by cached handles #56

Closed

wookayin mentioned this issue Feb 24, 2019

Improve speed by sparingly querying nvml #61

Closed

wookayin modified the milestones: 0.6, 1.0 Jul 22, 2019

This comment was marked as resolved.

Sign in to view

wookayin modified the milestones: 1.0, 1.1 Aug 13, 2021

wookayin modified the milestones: 1.1, 1.2 Mar 2, 2023

wookayin mentioned this issue Nov 22, 2023

perf: Improve performance by reducing redundant pynvml.nvmlInit call #166

Merged

wookayin closed this as completed in #166 Nov 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use too much CPU resource #54

use too much CPU resource #54

matthew-z commented Nov 8, 2018 •

edited

Loading

Stonesjtu commented Nov 9, 2018

matthew-z commented Nov 9, 2018

Stonesjtu commented Nov 9, 2018

matthew-z commented Nov 9, 2018

matthew-z commented Nov 9, 2018

wookayin commented Nov 9, 2018 •

edited

Loading

wookayin commented Nov 9, 2018

Stonesjtu commented Nov 10, 2018

wookayin commented Feb 24, 2019 •

edited

Loading

This comment was marked as resolved.

This comment was marked as resolved.

rkooo567 commented May 31, 2023

rkooo567 commented May 31, 2023

wookayin commented Nov 24, 2023

use too much CPU resource #54

use too much CPU resource #54

Comments

matthew-z commented Nov 8, 2018 • edited Loading

Stonesjtu commented Nov 9, 2018

matthew-z commented Nov 9, 2018

Stonesjtu commented Nov 9, 2018

matthew-z commented Nov 9, 2018

matthew-z commented Nov 9, 2018

wookayin commented Nov 9, 2018 • edited Loading

wookayin commented Nov 9, 2018

Stonesjtu commented Nov 10, 2018

wookayin commented Feb 24, 2019 • edited Loading

This comment was marked as resolved.

This comment was marked as resolved.

rkooo567 commented May 31, 2023

rkooo567 commented May 31, 2023

wookayin commented Nov 24, 2023

matthew-z commented Nov 8, 2018 •

edited

Loading

wookayin commented Nov 9, 2018 •

edited

Loading

wookayin commented Feb 24, 2019 •

edited

Loading