New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
smartd_log: wrong chart type for several attributes #7388
Comments
Problem reported by @Stoatwblr. i tend to agree. |
This should be a very quick fix. Can we do it? |
In general, no SMART attributes should be treated as incremental unless we want a running total. They’re all inherently absolute counters or absolute gauges, because that’s quite simply how SMART works. All of the stuff listed in the OP should indeed be incremental, and they’re also all definitely counters (they all count discrete events). Most of the other attributes with things like Any of the Any of the temperature attributes should be absolute gauges with special (and rather complicated) handling interpreting the raw values directly. I’m 99% certain though that we handle all of these correctly though. Possibly also see some of the discussion in #4285, as there was a lot of research that happened there relating to the |
Sorry, I was otherwise engaged. There are a few other entries which need fixing too.
Most of the "INCREMENTAL" entries should be looked at closely, however the important ones are:
read_total_err_corrected
read_total_unc_errors
write_total_err_corrected
write_total_unc_errors
verify_total_err_corrected
verify_total_unc_errors
spin_up_retries
calibration_retries
reallocated_sectors_count
program_fail_count
erase_fail_count
reallocation_event_count
When monitoring drives, it's easy to miss blips and what really matters is the total health numbers over time (It's worth reading the smart attribute tables for explanations of each item)
If these traces sit at zero it's far too easy to get a false sense of security
Originally posted by @Stoatwblr in #7383 (comment)
The text was updated successfully, but these errors were encountered: