-
Notifications
You must be signed in to change notification settings - Fork 5.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Netdata is not killing nvidia-smi process on exit #7143
Comments
I cant reproduce it. I cant check it on a real pc - ubuntu 12 is too old. What i did:
I see
Then i do
And check again
I see no zombie process, Killed by
@vobruba-martin you are saying that |
The Before shut down:
Immediately after calling
After some time when there is no netdata process:
|
Does it exit if you do?
|
Can you kill those nvidia-smi processes that are left over? Are they zombie processes? Are they unkillable? What is your hardware exactly? Is it possible that this is a hardware issue? Maybe upgrading your GPU driver solves this issue. There are reports in the internet of unkillable nvidia-smi processes from various tools. |
@vobruba-martin please check #7372, the problem should be fixed (or not)
And install it. |
try to execute
if it hangs, then there is no reason to try #7372 |
No!
It doesn't hang.
Will try it later if the info above will not help.
Yes, I can kill them with
HP ProLiant DL380p Gen8 Server with NVIDIA Tesla T4 GPU. |
Just modify your --- collectors/python.d.plugin/nvidia_smi/nvidia_smi.chart.py (date 1574785782000)
+++ collectors/python.d.plugin/nvidia_smi/nvidia_smi.chart.py (date 1574846528946)
@@ -346,14 +346,7 @@
self.poller = NvidiaSMIPoller(poll)
def get_data(self):
- if not self.poller.is_started():
- self.poller.start()
-
- if not self.poller.is_alive():
- self.debug('poller is off')
- return None
-
- last_data = self.poller.data()
+ last_data = self.poller.run_once()
if not last_data:
return None
|
@ilyam8 Yes, the problem is fixed with this change. |
@vobruba-martin i made it configurable, you need to off loop mode in the config file after #7372 is merged. loop_mode: no |
I've found one disadvantage of
|
You can filter these messages using syslog filters. If you put this line in the
|
See https://www.rsyslog.com/doc/v5-stable/configuration/filters.html Ok it is not
ubuntu12 |
Thanks! |
Bug report summary
If I shut down netdata I see that nvidia-smi process is not killed. Number of running nvidia-smi processes keeps growing if I do several netdata restarts.
OS / Environment
Ubuntu 12.04
Netdata version (ouput of
netdata -V
)netdata v1.18.0-44-nightly
Component Name
python.d/nvidia_smi
Steps To Reproduce
Enable nvidia-smi module. Start&Stop netdata.
Expected behavior
nvidia-smi process should not be running.
The text was updated successfully, but these errors were encountered: