Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

change edac ecc errors from incremental to absolute #16970

Merged

Conversation

ilyam8
Copy link
Member

@ilyam8 ilyam8 commented Feb 8, 2024

Summary

Fixes: #16969

This PR changes the following metrics from incremental to absolute:

  • mem.edac_mc: correctable/uncorrectable for each Memory Controller.
  • mem.edac_mc_dimm: correctable/uncorrectable for each DIMM module.

See the reasoning in #16969. I am inclined to agree that it is incorrect to have them incremental. Because of this, I decided to change the existing metrics rather than add new ones. I think we can say that the incremental metric is somewhat of a bug and a misleading metric. If you do not agree, indicate the use case when you need rates.

Netdata aggregates charts by context, so I changed the contexts to avoid incorrect aggregation. I don't think there's a way around. Again, did that because I thought of it as a bug.

Test Plan

Install, check charts and alarms.

Additional Information
For users: How does this change affect me?

@github-actions github-actions bot added area/health area/collectors Everything related to data collection collectors/proc area/metadata Integrations metadata labels Feb 8, 2024
@ilyam8 ilyam8 marked this pull request as ready for review February 8, 2024 11:12
@ilyam8 ilyam8 force-pushed the fix-collectors-make-ecc-errors-absolute branch from cb3c1e1 to 5fd8b46 Compare February 8, 2024 11:14
@ilyam8 ilyam8 force-pushed the fix-collectors-make-ecc-errors-absolute branch from 5fd8b46 to e55e65a Compare February 8, 2024 11:31
Copy link
Contributor

@thiagoftsm thiagoftsm left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PR is working as expected, LGTM!

@ilyam8 ilyam8 merged commit 8cc28e8 into netdata:master Feb 8, 2024
131 checks passed
@ilyam8 ilyam8 deleted the fix-collectors-make-ecc-errors-absolute branch February 8, 2024 17:06
@k0ste
Copy link
Contributor

k0ste commented Feb 8, 2024

Thanks!
Could this be backported to 1.44.3?

@ilyam8
Copy link
Member Author

ilyam8 commented Feb 8, 2024

@k0ste no, but I think we will have v1.45.0 soon. In two weeks I hope.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/collectors Everything related to data collection area/health area/metadata Integrations metadata collectors/proc
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Feat]: add CE & UE counter for EDAC collector
3 participants