-
Notifications
You must be signed in to change notification settings - Fork 31
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
"omiagent" segfault libnss_dns on Linux (scx provider) #96
Comments
Is there any activity on this? Even after almost a year we are still having these same problems with a number of our nodes. |
This is still a very prominent issue in Azure is there seriously no work being put into this anymore? It's a broken tool that's causing production VM's to hit full on space |
We have ended up creating a cron job to delete the core files (hopefully frequently enough to avoid HD filling) rather than waiting for a fix from Microsoft that it appears will never come. |
@srice01 Are you still having this issue with the latest versions ?
|
Yes (I am using CentOS 7.6.1810).
|
@srice01 Could you please open a support ticket and tell them to engage me (joburati) ? |
I am assuming you mean for me to create a support ticket in Azure. This is support request 119092422001455. |
Thanks @srice01, will get in touch with you via the ticket and try to get this moving. |
@srice01 Good news, I could fix the problem on your image. The issue is that the DSCForLinux extension install version 1.1.1-294 of the dsc package, this version cause omiagent to segfault. Installing version 1.1.1-926 fixes the issue. All those cases are related to this issue:
I have already submitted a fix to bump up the version of the dsc package: I am following up with PG internally for them to merge and push the fix: Meanwhile you can fix the issue by installing the package manually:
I hope this helps. |
@johanburati - This is indeed good news. Given that the DSCForLinux extension is installed by Azure (not ourselves) I take it your changes are to make sure the fixed version is installed by default in future? |
If you are having this issue check Azure/azure-linux-extensions#875 for details and solution. |
24 hours after installing the update and I have seen no core dumps...So I believe this is now resolved. |
Copied over from microsoft/omi#491 (please see this for full communication on this issue).
On our RM provisioned VMs in Azure we noticed that the root partition is filling up with large numbers of "core.###" files in the /var/opt/omi/run directory.
Further investigation shows segmentation faults (in /var/log/messages) as follows:
Environment information:
Operating System: CentOS Release 7.4.1708 (fully patched, that is, "yum update" shows no updates pending).
So far the workaround has been to write a cron job (!) to periodically wipe the core files but obviously this is not an ideal situation.
Further information from "JumpingYang001":
Following debug info shows omiagent loaded scx provider:
(gdb) info sharedlibrary
(*): Shared library is missing debugging information.
(gdb) The crash is on 0x00007fa58e405e00 which is in /lib64/libnss_dns.so.2, that is same as your segmentation faults in /var/log/messages.
Here are the threads:
Please let me know if any further debug information is required.
The text was updated successfully, but these errors were encountered: