fix(metrics): Track memory footprint more accurately [INGEST-1132] #1288

jjbayer · 2022-06-03T13:14:29Z

#1284 introduced a cost model for measuring the memory footprint of metrics buckets stored in the aggregator. It has two flaws:

It did not take into account the fixed size overhead of a BucketValue (only looked at the values inside).
It did not take into account the size overhead of storing the BucketKey.

This PR attempts to fix both issues.

relay-metrics/src/aggregation.rs

jjbayer · 2022-06-03T13:20:14Z

relay-metrics/src/aggregation.rs

-    // Choosing a BTreeMap instead of a HashMap here, under the assumption that a BTreeMap
-    // is still more efficient for the number of project keys we store.
-    cost_per_project_key: BTreeMap<ProjectKey, usize>,
+    cost_per_project_key: HashMap<ProjectKey, usize>,


I did not run any benchmarks, but figured that a HashMap is a safer choice w.r.t. scaling.

it's fine either way. btreemap might have an edge for small values

relay-metrics/src/aggregation.rs

jan-auer · 2022-06-03T13:59:00Z

relay-metrics/src/aggregation.rs

@@ -1287,7 +1339,7 @@ impl Aggregator {

                let flush_at = self.config.get_flush_time(timestamp, project_key);
                let bucket = value.into();
-                added_cost = bucket.cost();
+                added_cost = entry.key().cost() + bucket.cost();


While at it, it would be good to deal with the Occupied branch too. See #1287 (comment)

The Occupied branch is handled correctly (though not elegantly):

relay/relay-metrics/src/aggregation.rs

Lines 1303 to 1306 in 5764c12

let cost_before = bucket_value.cost();

value.merge_into(bucket_value)?;

let cost_after = bucket_value.cost();

added_cost = cost_after.saturating_sub(cost_before);

The advantage of computing the cost twice and subtracting is that we only need a single function for computing bucket value cost. If we did something like added_cost = if something_added { value.incremental_cost() } else { 0 }, we would need to also implement an incremental_cost function on BucketValue and MetricValue.

untitaker · 2022-06-07T13:40:41Z

relay-metrics/src/aggregation.rs

-    // Choosing a BTreeMap instead of a HashMap here, under the assumption that a BTreeMap
-    // is still more efficient for the number of project keys we store.
-    cost_per_project_key: BTreeMap<ProjectKey, usize>,
+    cost_per_project_key: HashMap<ProjectKey, usize>,


it's fine either way. btreemap might have an edge for small values

untitaker · 2022-06-07T13:41:20Z

relay-metrics/src/aggregation.rs

-                    let value = std::mem::replace(&mut entry.value, BucketValue::Counter(0.0));
-                    cost_tracker.subtract_cost(key.project_key, value.cost());
+                    let value = mem::replace(&mut entry.value, BucketValue::Counter(0.0));
+                    cost_tracker.subtract_cost(key.project_key, key.cost() + value.cost());


can we call subtract_cost twice instead? then overflow/underflow handling is also dealt with in one place

* master: ref(metrics): Stop logging relative bucket size (#1302) fix(metrics): Rename misnamed aggregator option (#1298) fix(server): Avoid a panic in the Sentry middleware (#1301) build: Update dependencies with known vulnerabilities (#1294) fix(metrics): Stop logging statsd metric per project key (#1295) feat(metrics): Limits on bucketing cost in aggregator [INGEST-1132] (#1287) fix(metrics): Track memory footprint more accurately (#1288) build(deps): Bump dependencies (#1293) feat(aws): Add relay-aws-extension crate which implements AWS extension as an actor (#1277) fix(meta): Update codeowners for the release actions (#1286) feat(metrics): Track memory footprint of metrics buckets (#1284)

jjbayer added 6 commits June 3, 2022 11:35

ref: Use type aliases and constants in cost model

585ded8

ref: Use HashMap for cost tracker

aec39ce

fix: Respect size_of(BucketValue) for every bucket value

734d952

fix: Add cost for bucket keys

b0c5a95

fix: test

100f81f

PR self-review

66c19d5

jjbayer commented Jun 3, 2022

View reviewed changes

jan-auer reviewed Jun 3, 2022

View reviewed changes

relay-metrics/src/aggregation.rs Outdated Show resolved Hide resolved

relay-metrics/src/aggregation.rs Outdated Show resolved Hide resolved

relay-metrics/src/aggregation.rs Outdated Show resolved Hide resolved

relay-metrics/src/aggregation.rs Outdated Show resolved Hide resolved

jan-auer reviewed Jun 3, 2022

View reviewed changes

jjbayer added 3 commits June 7, 2022 08:45

ref: Use size_of in cost model, hard code values in test

975f1b9

ref: Use type aliases for metric types

0a70239

ref: Change fmt::Debug for CostTracker

5764c12

jjbayer marked this pull request as ready for review June 7, 2022 07:37

jjbayer requested a review from a team June 7, 2022 07:37

untitaker approved these changes Jun 7, 2022

View reviewed changes

ref: Call subtract_cost twice

8f82611

jjbayer merged commit 192a8eb into master Jun 7, 2022

jjbayer deleted the ref/track-metrics-footprint-2 branch June 7, 2022 14:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(metrics): Track memory footprint more accurately [INGEST-1132] #1288

fix(metrics): Track memory footprint more accurately [INGEST-1132] #1288

jjbayer commented Jun 3, 2022

jjbayer Jun 3, 2022

untitaker Jun 7, 2022

jan-auer Jun 3, 2022

jjbayer Jun 7, 2022

untitaker Jun 7, 2022

untitaker Jun 7, 2022

	let cost_before = bucket_value.cost();
	value.merge_into(bucket_value)?;
	let cost_after = bucket_value.cost();
	added_cost = cost_after.saturating_sub(cost_before);

fix(metrics): Track memory footprint more accurately [INGEST-1132] #1288

fix(metrics): Track memory footprint more accurately [INGEST-1132] #1288

Conversation

jjbayer commented Jun 3, 2022

jjbayer Jun 3, 2022

Choose a reason for hiding this comment

untitaker Jun 7, 2022

Choose a reason for hiding this comment

jan-auer Jun 3, 2022

Choose a reason for hiding this comment

jjbayer Jun 7, 2022

Choose a reason for hiding this comment

untitaker Jun 7, 2022

Choose a reason for hiding this comment

untitaker Jun 7, 2022

Choose a reason for hiding this comment