This PR adds 2 more charts:
Both charts show the number of errors/s for each memory controller.
and 3 new alarms:
The charts and the alarms have been configured to not be added if there are errors at all. They will be added when the first errors occurs (in which case, alarms will be dispatched too).
To check if your system has this capability, fetch for http://your.netdata.ip:19999/netdata.conf and search for this section:
# directory to monitor = /sys/devices/system/edac/mc
# enable ECC memory correctable errors = auto
# enable ECC memory uncorrectable errors = auto
If the later 2 lines exist, your system has this capability. You can also set them to yes (instead of auto), to view the (empty) charts at the memory section of the dashboard and verify there are alarms for them. (if they are not empty, they should show up by themselves).
detect ECC memory correctable and uncorrectable errors; fixes #1508
added health.d/memory.conf to make files
fix ondemand switch for hwcorrupted memory
move hwcorrupted chart to ecc family