feat(metrics): Another statsd metric to measure bucket duplication [INGEST-421] #1128

untitaker · 2021-11-17T11:49:15Z

Add a set metric that measures the number of unique bucket keys observed at a point/interval in time. Combined with other metrics this can help us in measuring how much bucket duplication happens because of horizontal scaling.

This requires us to implement some way of hashing bucket keys such that statsd can consume them. BucketKey already implements Hash to be used in hashmaps. We cannot however use the std hasher:

docs for DefaultHasher explicitly state that the hashes may change across rust releases (i guess this would not be too much of a blocker for our purpose, but it's annoying to keep in mind)
SipHasher is deprecated (but could probably be used)

we already have crc32fast in our dependency tree so let's depend on it
explicitly and just use that

there's still one caveat in that the impls of Hash may call different methods depending on cpu architecture (as stated in https://docs.rs/deterministic-hash/1.0.1/deterministic_hash/), but I think we can live with that for now.

…NGEST-421] Add a set metric that measures the number of unique bucket keys observed at a point/interval in time. This requires us to implement some way of hashing bucket keys such that statsd can consume them. BucketKey already implements Hash to be used in hashmaps. We cannot however use the std hasher: * docs for DefaultHasher explicitly state that the hashes may change across rust releases (i guess this would not be too much of a blocker for our purpose, but it's annoying to keep in mind) * SipHasher is deprecated (but could probably be used) we already have crc32fast in our dependency tree so let's depend on it explicitly and just use that there's still one caveat in that the impls of Hash may call different methods depending on cpu architecture (as stated in https://docs.rs/deterministic-hash/1.0.1/deterministic_hash/), but I think we can live with that for now.

* master: test(outcomes): Fix sort order in flaky test (#1135) feat(outcomes): Aggregate more outcomes (#1134) ref(outcomes): Fold processing vs non-processing into single actor (#1133) build: Update symbolic to support UE5 (#1132) feat(metrics): Extract measurement ratings, port from frontend (#1130) feat(metrics): Another statsd metric to measure bucket duplication (#1128) feat(outcomes): Emit outcomes as client reports (#1119) fix: Move changelog line to right version (#1129) fix(dangerjs): Do not suggest to add JIRA ticket to changelog (#1125) feat(metrics): Tag metrics by transaction name [INGEST-542] (#1126)

untitaker requested a review from a team November 17, 2021 11:49

untitaker added 2 commits November 17, 2021 15:39

type -> trait

2cfb00f

add changelog

e3cd9c8

jjbayer approved these changes Nov 17, 2021

View reviewed changes

untitaker added 2 commits November 18, 2021 10:49

Merge branch 'master' into feat/bucket-set-metric

cf70705

Merge branch 'master' into feat/bucket-set-metric

3fa47ba

untitaker merged commit 113b719 into master Nov 18, 2021

untitaker deleted the feat/bucket-set-metric branch November 18, 2021 10:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(metrics): Another statsd metric to measure bucket duplication [INGEST-421] #1128

feat(metrics): Another statsd metric to measure bucket duplication [INGEST-421] #1128

untitaker commented Nov 17, 2021

feat(metrics): Another statsd metric to measure bucket duplication [INGEST-421] #1128

feat(metrics): Another statsd metric to measure bucket duplication [INGEST-421] #1128

Conversation

untitaker commented Nov 17, 2021