New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
NCPA not working properly #643
Comments
df -gtFilesystem GB blocks Used Free %Used Mounted on mountnode mounted mounted over vfs date options
detnas.onemagnify.corp /dbrestore /mnt/dbrestore nfs3 Jun 14 01:26 rw,bg,hard,rsize=32768,wsize=32768,timeo=600,vers=3,sec=sys df -gt /mnt/oracledumpFilesystem GB blocks Used Free %Used Mounted on dateWed Jun 17 13:14:05 EDT 2020 |
rpm -qa|grep -i ncpancpa-2.2.1-1.ppc oslevel -s7100-05-05-1939 hostnamevendprodcdb.marketingassociates.com uname -uMIBM,9119-MHE IBM,02219A857 prtconf|head -15System Model: IBM,9119-MHE dateWed Jun 17 15:48:09 EDT 2020 |
Will the montpoint be shown in the output of the following command?:
|
I dont see that mountpoint on this above command output [root@manxi01 ~]# /usr/local/nagios/libexec/check_ncpa.py -H vendprodcdb.marketingassociates.com -t 'mytoken12345' -P 5693 -M 'disk/logical' -l But Still the mount point exist on the server. vendprodcdb.marketingassociates.com Not sure why it is not getting monitored by nagios |
I found this line in previous posts from you:
Is this mountpoint a nfs share? If it is a nfs share, can you try to set #
# Display all mounted disk partitions
# (essentially setting all=True here: https://psutil.readthedocs.io/en/latest/#psutil.disk_partitions)
# Default: 1
#
all_partitions = 1
#
# Excluded file system types removes these fs types from the disk metrics
# (This is mostly only noteable on UNIX systems but also works on Windows if you need it)
# Default: aufs,autofs,binfmt_misc,cifs,cgroup,configfs,debugfs,devpts,devtmpfs,
# encryptfs,efivarfs,fuse,fusectl,hugetlbfs,mqueue,nfs,overlayfs,proc,pstore,
# rpc_pipefs,securityfs,selinuxfs,smb,sysfs,tmpfs,tracefs
#
exclude_fs_types = aufs,autofs,binfmt_misc,cifs,cgroup,configfs,debugfs,devpts,devtmpfs,encryptfs,efivarfs,fuse,fusectl,hugetlbfs,mqueue,nfs,overlayfs,proc,pstore,rpc_pipefs,securityfs,selinuxfs,smb,sysfs,tmpfs,tracefs Then restart the ncpa_listener service and try it again. |
Given setting is already set and is in place on the server. cat /usr/local/ncpa/etc/ncpa.cfg|grep -i excludeExcluded file system types removes these fs types from the disk metricsexclude_fs_types = aufs,autofs,binfmt_misc,cgroup,configfs,debugfs,devpts,devtmpfs,encryptfs,efivarfs,fuse,fusectl,hugetlbfs,mqueue,overlayfs,proc,pstore,rpc_pipefs,securityfs,selinuxfs,smb,sysfs,tmpfs,tracefs cat /usr/local/ncpa/etc/ncpa.cfg|grep -i all_partitionsall_partitions = 1 hostnamevendprodcdb.marketingassociates.com dateThu Aug 13 13:29:16 EDT 2020 |
Any Update ? |
Can you post the output of the following commands?:
|
ls -lah /proc/diskstatsls: Not a recognized flag: h ls -la /proc/diskstatsls: 0653-341 The file /proc/diskstats does not exist. cat /proc/diskstatscat: 0652-050 Cannot open /proc/diskstats. ls -la /sys/blockls: 0653-341 The file /sys/block does not exist. ps aux | grep -i ncpa_listenerps -ef|grep -i ncpa_listenernagios 6751022 5833536 0 Aug 09 - 47:33 /usr/local/ncpa/ncpa_listener -n dateThu Aug 20 13:42:04 EDT 2020 |
Can you try to run ncpa with root priviliges?:
|
I tried to run the ncpa agent with root privilage but once after changing the UID / GID to root I could see that the service is not comming up. bash-4.2# lssrc -a|grep -i ncpa |
Can you provide the latest lines of ncpa_listener.log when you have tried to start ncpa with root priviliges? |
Can you set the |
lssrc -a|grep -i ncpancpa_passive 5898946 active cat /usr/local/ncpa/etc/ncpa.cfg|grep -i loglevelDefault: loglevel = info (debug, info, warning, error)loglevel = debug Default: loglevel = info (debug, info, warning, error)loglevel = debug dateMon Aug 24 10:29:55 EDT 2020 Please let us know the time duration for this debug logging. |
ls -ltr /var/log/ncpa_listener.logls: 0653-341 The file /var/log/ncpa_listener.log does not exist. dateMon Aug 24 10:33:01 EDT 2020 I dont see any loging over here |
Is the ncpa_listener now running with root privileges? |
The |
I cant see any suspicious behavior in the logfile. Can you please try these two commands?:
|
[root@manxi01 ~]# /usr/local/nagios/libexec/check_ncpa.py -H vendprodcdb.marketingassociates.com -t 'mytoken12345' -P 5693 -M 'disk/physical' -l |
[root@manxi01 ~]# /usr/local/nagios/libexec/check_ncpa.py -H vendprodcdb.marketingassociates.com -t 'mytoken12345' -P 5693 -M 'disk/mountl' -l |
My mistake, there is a typo in the metrics portion of the command. Sorry.
Could you install the python module |
[root@manxi01 ~]# /usr/local/nagios/libexec/check_ncpa.py -H vendprodcdb.marketingassociates.com -t 'mytoken12345' -P 5693 -M 'disk/mount' -l I'm checking on PSUTILS for AIX as the Affected system is AIX 7.2, will update you shortly on that. |
PSUTILS is installed now. rpm -qa|grep -i psutilspsutils-1.23-1.ppc dateTue Aug 25 14:45:32 EDT 2020 |
Cool, thank you!
|
We are performing in Prod Environment. Please let me know if this affects any running services on the server |
It should not have any impact. This is the way ncpa gets some disk information. |
./test_disk.shpython: A file or directory in the path name does not exist. |
Can you try this command?:
|
/usr/bin/env pythonpython: A file or directory in the path name does not exist. |
I checked on the PSUTIL file and I don't see any available package for installation on the AIX 7.1 Server |
Any update on this. |
As mentioned here #670 (comment) psutils will be updated with the next build (2.3.0). Maybe this will fix your issue too. |
Version |
I'm actually having problems with the AIX builds at the moment, unfortunately I don't think we will be able to build it again on there until Python 3 due to the fact that Python 2 doesn't build well from source on the AIX systems I have access to and cx_Freeze doesn't work properly with python 2.7 in AIX. |
Closing this since we won't be doing a new official AIX build until sometime after NCPA 3 is released. |
[nagios@manxi01.marketingassociates.com ~]$ /usr/local/nagios/libexec/check_ncpa.py -H vendprodcdb.marketingassociates.com -t 'mytoken12345' -P 5693 -M 'disk/logical/|mnt|oracledump' -w 80 -c 90
UNKNOWN: The node (/mnt/oracledump) requested does not exist. You may be trying to access the '/' node.
[root@manxi01 ~]# /usr/local/nagios/libexec/check_ncpa.py -H vendprodcdb.marketingassociates.com -t 'mytoken12345' -P 5693 -M 'disk/logical/|mnt|oracledump' -w 80 -c 90 -v -D
Connecting to: https://vendprodcdb.marketingassociates.com:5693/api/disk/logical/%7Cmnt%7Coracledump/?token=mytoken12345&warning=80&critical=90&check=1
File returned contained:
{
"returncode": 3,
"stdout": "UNKNOWN: The node (/mnt/oracledump) requested does not exist. You may be trying to access the '/' node."
}
UNKNOWN: The node (/mnt/oracledump) requested does not exist. You may be trying to access the '/' node.
The text was updated successfully, but these errors were encountered: