fix for hrStorageIndex agility #15028

peejaychilds · 2023-05-09T03:09:48Z

Please give a short description what your pull request is for

It is possible that the HOST-RESOURCES-MIB::hrStorageTable has updated between discovery and polling. If we assume it hasn't we can end up assigning the values of the wrong mount point to another mount point. This can cause
issues with not only display, but also alarming.

We can compare the description expected with one from the hrStorageIndex entry and if they are not equal then we can filter the table to return an entry that does match.

We have found with JunOS EVO when anyone logs into the box it mounts
tmpfs 3.1G 0 3.1G 0% /run/user/{uid}
which causes the ordering of the HOST-RESOURCES-MIB::hrStorageTable to change which causes all sorts of issues ... (like Storage alarms because some partition just got muddled up with some other suppressed partition (ie a loopback mount) that has 0% free space)

Please note

Please read this information carefully. You can run ./lnms dev:check to check your code before submitting.

Have you followed our code guidelines?
If my Pull Request does some changes/fixes/enhancements in the WebUI, I have inserted a screenshot of it.
If my Pull Request makes discovery/polling/yaml changes, I have added/updated test data.

Testers

If you would like to test this pull request then please run: ./scripts/github-apply <pr_id>, i.e ./scripts/github-apply 5926
After you are done testing, you can remove the changes with ./scripts/github-remove. If there are schema changes, you can ask on discord how to revert.

includes/polling/storage/hrstorage.inc.php

PJGuyTen · 2023-05-09T14:05:47Z

Thank you for looking into this! I've been trying to figure out a fix for this for many versions!

Jellyfrog · 2023-05-09T15:36:49Z

Would it be possible to index on something else, like mountpoint instead? Guessing it breaks the design too much?

peejaychilds · 2023-05-10T03:28:07Z

Would it be possible to index on something else, like mountpoint instead? Guessing it breaks the design too much?

HOST-RESOURCES-MIB::hrStorageTable doesn't have the mount point unfortunately.

    29 => 
    array (
      'hrStorageIndex' => '29',
      'hrStorageType' => 'hrStorageFixedDisk',
      'hrStorageDescr' => 're0:/dev/sda6, mounted on /config',
      'hrStorageAllocationUnits' => '4096',
      'hrStorageSize' => '251878',
      'hrStorageUsed' => '375',
      'hrStorageAllocationFailures' => '0',
    ),

murrant · 2023-05-10T12:12:51Z

Mounts changing hrStorageIndex is a really annoying bug in your snmpd implementation...

peejaychilds · 2023-07-06T01:45:30Z

@murrant tested works ok

Test with JunOS EVO box

Pre-Test : Set device 42 into non-polling poller group? (use GROUP 5 TYO )
Test Command - ./poller.php -d -v -h 42 -m storage
Examine re0:/dev/sda1, mounted on /boot (id=1579)

Control Test
On one dispatcher perform the following

'unpatch my patch' >> git checkout includes/polling/storage/hrstorage.inc.php
run ./poller.php -d -v -h 42 -m storage > hr-test1
log user into device
run ./poller.php -d -v -h 42 -m storage > hr-test2
log user out of device
run ./poller.php -d -v -h 42 -m storage > hr-test3

grep 1579] hr-te*
hr-test1:SQL[UPDATE `storage` set `storage_used`=?,`storage_free`=?,`storage_size`=?,`storage_units`=?,`storage_perc`=? WHERE `storage_id` = ? ["19242496","186197504","205440000",512,9,1579] 1.07ms]
hr-test2:SQL[UPDATE `storage` set `storage_used`=?,`storage_free`=?,`storage_size`=?,`storage_units`=?,`storage_perc`=? WHERE `storage_id` = ? ["39051264","0","39051264",8192,100,1579] 0.91ms]
hr-test3:SQL[UPDATE `storage` set `storage_used`=?,`storage_free`=?,`storage_size`=?,`storage_units`=?,`storage_perc`=? WHERE `storage_id` = ? ["19242496","186197504","205440000",512,9,1579] 1.03ms]

grep "/boot" hr-te* | grep :hrStorage
hr-test1:hrStorageDescr.25 = re0:/dev/sda1, mounted on /boot
hr-test2:hrStorageDescr.26 = re0:/dev/sda1, mounted on /boot
hr-test3:hrStorageDescr.25 = re0:/dev/sda1, mounted on /boot

Results -> Notice the index's change, and the storage_used, storage_free etc follow the index, so are set incorrectly in 'test2' for id=1597

Patch Test

On second dispatcher perform the following

'unpatch' >> git checkout includes/polling/storage/hrstorage.inc.php
apply new patch via ./scripts/github-apply 15028
run ./poller.php -d -v -h 42 -m storage > hr-test1
log user into device
run ./poller.php -d -v -h 42 -m storage > hr-test2
log user out of device
run ./poller.php -d -v -h 42 -m storage > hr-test3

grep 1579] hr-te*
hr-test1:SQL[UPDATE `storage` set `storage_used`=?,`storage_free`=?,`storage_size`=?,`storage_units`=?,`storage_perc`=? WHERE `storage_id` = ? ["19242496","186197504","205440000",512,9,1579] 1.37ms]
hr-test2:SQL[UPDATE `storage` set `storage_used`=?,`storage_free`=?,`storage_size`=?,`storage_units`=?,`storage_perc`=? WHERE `storage_id` = ? ["19242496","186197504","205440000",512,9,1579] 0.46ms]
hr-test3:SQL[UPDATE `storage` set `storage_used`=?,`storage_free`=?,`storage_size`=?,`storage_units`=?,`storage_perc`=? WHERE `storage_id` = ? ["19242496","186197504","205440000",512,9,1579] 0.83ms]

grep "/boot" hr-te* | grep :hrStorage
hr-test1:hrStorageDescr.25 = re0:/dev/sda1, mounted on /boot
hr-test2:hrStorageDescr.26 = re0:/dev/sda1, mounted on /boot
hr-test3:hrStorageDescr.25 = re0:/dev/sda1, mounted on /boot

 grep quick hr*
hr-test2:Storage re0:/dev/sda5, mounted on /etc changed index 27 > 28, applying quickfix until discovery runs
hr-test2:Storage re0:/dev/sda6, mounted on /config changed index 28 > 29, applying quickfix until discovery runs
hr-test2:Storage re0:/dev/sda7, mounted on /var changed index 29 > 30, applying quickfix until discovery runs
hr-test2:Storage re0:/dev/sda2, mounted on /soft changed index 26 > 27, applying quickfix until discovery runs
hr-test2:Storage re0:/dev/loop0, mounted on /data/var/external changed index 9 > 10, applying quickfix until discovery runs
hr-test2:Storage re0:/dev/sda1, mounted on /boot changed index 25 > 26, applying quickfix until discovery runs

Results -> Notice the index's change HOWEVER the storage_used, storage_free etc follow the description, so are set correctly in 'test2' for id=1597

Post-Test set device back to poller group = 0

librenms-bot · 2023-07-17T04:44:50Z

This pull request has been mentioned on LibreNMS Community. There might be relevant details there:

https://community.librenms.org/t/23-7-0-changelog/21841/1

* fix for hrStorageIndex agility * test for array * Handle not found data * Handle description changed correctly * remove debug --------- Co-authored-by: Tony Murray <murraytony@gmail.com>

fix for hrStorageIndex agility

31a640e

murrant reviewed May 9, 2023

View reviewed changes

includes/polling/storage/hrstorage.inc.php Outdated Show resolved Hide resolved

test for array

b4cc955

PJGuyTen approved these changes Jun 15, 2023

View reviewed changes

murrant added 3 commits June 28, 2023 10:29

Handle not found data

2ba8d34

Handle description changed correctly

1c411f5

remove debug

9b187ae

murrant approved these changes Jul 7, 2023

View reviewed changes

murrant merged commit 7e22b12 into librenms:master Jul 7, 2023
8 checks passed

murrant added Bug 🕷️ Polling labels Jul 17, 2023

peejaychilds deleted the hrstorage_fix branch November 22, 2023 02:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix for hrStorageIndex agility #15028

fix for hrStorageIndex agility #15028

peejaychilds commented May 9, 2023

PJGuyTen commented May 9, 2023

Jellyfrog commented May 9, 2023 •

edited

peejaychilds commented May 10, 2023

murrant commented May 10, 2023

peejaychilds commented Jul 6, 2023

librenms-bot commented Jul 17, 2023

fix for hrStorageIndex agility #15028

fix for hrStorageIndex agility #15028

Conversation

peejaychilds commented May 9, 2023

Please note

Testers

PJGuyTen commented May 9, 2023

Jellyfrog commented May 9, 2023 • edited

peejaychilds commented May 10, 2023

murrant commented May 10, 2023

peejaychilds commented Jul 6, 2023

librenms-bot commented Jul 17, 2023

Jellyfrog commented May 9, 2023 •

edited