feat(metrics): Split buckets into partitions (dry run) [INGEST-1472] #1425

jjbayer · 2022-08-18T15:56:29Z

Measure what distributions we would obtain if we split flush buckets not only by project, but also by partition, which is determined by a configurable number of partitions and hashing the bucket key.

#skip-changelog

jjbayer · 2022-08-18T15:58:28Z

relay-metrics/src/aggregation.rs

+                        .entry(key.project_key)
+                        .or_default()
+                        .push(HashedBucket {
+                            hashed_key: key.as_integer_lossy(), // TODO: Do we need a more reliable hasher?


Not sure if we can rely on this hasher, this comment seems pretty clear:

relay/relay-metrics/src/aggregation.rs

Lines 894 to 895 in a9d9390

// XXX: The way this hasher is used may be platform-dependent. If we want to produce the

// same hash across platforms, the `deterministic_hash` crate may be useful.

I think I looked into this, didn't find any great, fast crates, and then @jan-auer pointed out that we already have FMV for this.

Considering we're doing sharding of kafka topics by org id at some point, I think it's time well spent to look into a hashing function that is deterministic and portable (can be replicated in Python), and I am not sure if this function here fulfills any of those criteria.

we already have FMV for this.

Good point, will replace with FnvHasher.

FnvHasher

If you have the time, ensure we're picking a hashing function that is easily usable in python in any case. while portability not required for this story, we will need it in future tasks for traffic steering

There's a Python impl for FNV, and it looks simple enough to write ourselves if necessary:
https://pypi.org/project/fnvhash/
https://en.wikipedia.org/wiki/Fowler%E2%80%93Noll%E2%80%93Vo_hash_function

…ng-dry-run

…ry-run

untitaker · 2022-08-29T11:30:28Z

relay-metrics/src/aggregation.rs

        // XXX: The way this hasher is used may be platform-dependent. If we want to produce the
        // same hash across platforms, the `deterministic_hash` crate may be useful.
+
+        // TODO(jjbayer): Use FnvHasher here


as discussed, fnv is fine and easy to impl, let's do it

untitaker · 2022-08-29T11:32:02Z

relay-metrics/src/aggregation.rs

+            relay_statsd::metric!(
+                histogram(MetricHistograms::BucketsPerBatch) = batch.len() as f64,
+                partition_key = partition_tag.as_str(),
+                batch_index = format!("{i}").as_str(),


please don't tag this, as the cardinality is theoretically unbound, and instead add a histogram metric counting the number of batches within a partition, i.e. metric!(histogram(..) = capped_batches.len());

untitaker · 2022-08-29T11:32:24Z

relay-metrics/src/aggregation.rs

+        let capped_batches =
+            CappedBucketIter::new(buckets.into_iter(), self.config.max_flush_bytes);
+        let partition_tag = match partition_key {
+            Some(partition_key) => format!("{partition_key}"),


I think we prefer to use .to_string() instead of format!("{..}")

untitaker · 2022-08-29T11:34:28Z

relay-statsd/src/lib.rs

+
+    f();
+
+    *METRICS_CLIENT.write() = old_client;


I think you might have those changes locally, but this needs to be done per-thread

…ng-dry-run

jjbayer · 2022-08-29T13:50:55Z

relay-metrics/Cargo.toml

@@ -20,7 +20,7 @@ relay-system = { path = "../relay-system" }
 serde = { version = "1.0.114", features = ["derive"] }
 serde_json = "1.0.55"
 failure = "0.1.8"
-crc32fast = "1.2.1"
+fnv = "1.0.7"


This dependency was already in Cargo.lock.

jjbayer · 2022-08-29T13:51:48Z

relay-metrics/src/aggregation.rs

+    // Create a 64-bit hash of the bucket key using FnvHasher.
+    // This is used for partition key computation and statsd logging.
+    fn hash64(&self) -> u64 {
+        let mut hasher = FnvHasher::default();


With fnv::FnvHasher we can auto-derive Hash, with hash32, we cannot.

Convert the dry run implemented in #1425 into an actual batching mechanism that splits metrics buckets into logical partitions. The partition_key has to be passed through ProjectCache, EnvelopeManager and UpstreamRelay to be set as a header on the outgoing envelope request.

jjbayer added 2 commits August 18, 2022 17:32

wip

46879b6

feat: Implement dry run (untested)

a9d9390

jjbayer commented Aug 18, 2022

View reviewed changes

jjbayer added 15 commits August 19, 2022 08:11

ref: More iterators, fewer vectors

1d64ef2

Merge remote-tracking branch 'origin/master' into feat/traffic-steeri…

ce34025

…ng-dry-run

wip

913c476

wip: compiles

250d0e6

Working capture mode

aff36dc

Merge branch 'feat/test-capture-metrics' into feat/traffic-steering-d…

1e65d52

…ry-run

wip: Use SpyMetricSink (hangs)

6bd76de

ref: Remove CURRENT_CLIENT

c866588

ref: Use SpyMetricSink

b44f95c

ref: Remove dbg

d814043

Merge branch 'feat/test-capture-metrics' into feat/traffic-steering-d…

446f281

…ry-run

wip

332361f

ref: testable

c940678

test: minimal test

d8e9670

test: One more test case

60854e2

jjbayer mentioned this pull request Aug 29, 2022

fix(metrics): CappedBucketIter edge cases [INGEST-1472] #1439

Merged

untitaker requested changes Aug 29, 2022

View reviewed changes

jjbayer added 4 commits August 29, 2022 13:51

Merge remote-tracking branch 'origin/master' into feat/traffic-steeri…

85b8619

…ng-dry-run

ref: Use hash32::FnvHasher for bucket key hashing

91442f9

ref: Use 64 bit hasher for simlpicity

27d8b1e

ref: Emit separate metric for num batches per shard

30f52ea

jjbayer requested a review from untitaker August 29, 2022 13:50

jjbayer commented Aug 29, 2022

View reviewed changes

jjbayer marked this pull request as ready for review August 29, 2022 13:52

jjbayer requested a review from a team August 29, 2022 13:52

untitaker approved these changes Aug 29, 2022

View reviewed changes

jjbayer merged commit 8baa00c into master Aug 29, 2022

jjbayer deleted the feat/traffic-steering-dry-run branch August 29, 2022 14:13

jjbayer mentioned this pull request Aug 30, 2022

feat(metrics): Actually batch by partition, set request header [INGEST-1562] #1440

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(metrics): Split buckets into partitions (dry run) [INGEST-1472] #1425

feat(metrics): Split buckets into partitions (dry run) [INGEST-1472] #1425

jjbayer commented Aug 18, 2022 •

edited

Loading

jjbayer Aug 18, 2022

untitaker Aug 18, 2022 •

edited

Loading

jjbayer Aug 19, 2022

untitaker Aug 19, 2022

jjbayer Aug 29, 2022

untitaker Aug 29, 2022

untitaker Aug 29, 2022

untitaker Aug 29, 2022

untitaker Aug 29, 2022

jjbayer Aug 29, 2022

jjbayer Aug 29, 2022

	// XXX: The way this hasher is used may be platform-dependent. If we want to produce the
	// same hash across platforms, the `deterministic_hash` crate may be useful.

feat(metrics): Split buckets into partitions (dry run) [INGEST-1472] #1425

feat(metrics): Split buckets into partitions (dry run) [INGEST-1472] #1425

Conversation

jjbayer commented Aug 18, 2022 • edited Loading

Choose a reason for hiding this comment

untitaker Aug 18, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jjbayer commented Aug 18, 2022 •

edited

Loading

untitaker Aug 18, 2022 •

edited

Loading