test/persist: add test that compaction is happening as expected #10541

ruchirK · 2022-02-08T23:59:13Z

This commit adds a smoke test to check that persisted tables are being compacted
as they should be, using metrics for the number of batches in a persistent trace
to assert that the number decreases as expected.

This test is a little bit brittle, as it is very coupled to the way trace compaction
works today. A better alternative would be some kind of system table that can
automatically return details like the since frontier or the number of trace
batches. That is left as a followup.

Motivation

Tips for reviewer

Checklist

This PR has adequate test coverage / QA involvement has been duly considered.
This PR adds a release note for any user-facing behavior changes.

Touches MaterializeInc#10533 This commit adds a smoke test to check that persisted tables are being compacted as they should be, using metrics for the number of batches in a persistent trace to assert that the number decreases as expected. This test is a little bit brittle, as it is very coupled to the way trace compaction works today. A better alternative would be some kind of system table that can automatically return details like the since frontier or the number of trace batches. That is left as a followup.

ruchirK · 2022-02-09T00:03:19Z

verified that this fails without the fix for #10533 Hopefully, this test can help us prevent future regressions!

aljoscha · 2022-02-09T08:16:02Z

I think the test looks fine, but yes, it's also closely tied to how compaction works today. Could we maybe just do a lot of inserts, then , then verify that the blob count is below some reasonable threshold?

philip-stoev

Thank you very much! Lets see if there is any flakiness introduced and we will reevaluate.

ruchirK · 2022-02-09T16:46:28Z

We can actually do better than passively waiting for flakiness

altaria-2:materialize Test$ git diff
diff --git a/test/persistence/mzcompose.py b/test/persistence/mzcompose.py
index 1cc13734a..3e47767f8 100644
--- a/test/persistence/mzcompose.py
+++ b/test/persistence/mzcompose.py
@@ -168,3 +168,8 @@ def workflow_compaction(c: Composition) -> None:
     c.rm("materialized", "testdrive-svc", destroy_volumes=True)

     c.rm_volumes("mzdata")
+
+
+def workflow_stress_compaction(c: Composition) -> None:
+    for i in range(0, 200):
+        workflow_compaction(c)
altaria-2:materialize Test$

running this now to see if we observe any failures over 200 runs. will merge if thats green!

ruchirK · 2022-02-09T16:57:58Z

I think the test looks fine, but yes, it's also closely tied to how compaction works today. Could we maybe just do a lot of inserts, then , then verify that the blob count is below some reasonable threshold?

I thought about this and felt like it was non-trivial to write a test that would not give either a large percentage of false negatives (flakes) or false positives (fail to see a regression). When you send a large number of inserts its a bit opaque how many trace batches will get produced and its similarly opaque when those trace batches will get compacted. That makes it hard to figure out:

how should we set the baseline number of trace batches to look for?
how long should we wait to declare failure?

This test concretely observes the number of trace batches increasing and decreasing, and should break if we change trace compaction - but hopefully I'll be the only person hit by that for a while, and we can revisit the test when we have better introspection primitives

ruchirK · 2022-02-09T19:08:16Z

Aljoscha approved from Slack and this test ran 200 times without error so merging! TFTR!

ruchirK requested review from aljoscha and philip-stoev February 8, 2022 23:59

ruchirK force-pushed the rk-10533-followup branch from 2614946 to 0d4a8fa Compare February 9, 2022 00:02

philip-stoev approved these changes Feb 9, 2022

View reviewed changes

ruchirK merged commit 6059761 into MaterializeInc:main Feb 9, 2022

ruchirK mentioned this pull request Feb 14, 2022

persist: trace compaction seems broken #10300

Closed

materialize-bot mentioned this pull request Feb 16, 2022

release: v0.21.0-rc1 required reviews #10727

Closed

29 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test/persist: add test that compaction is happening as expected #10541

test/persist: add test that compaction is happening as expected #10541

ruchirK commented Feb 8, 2022

ruchirK commented Feb 9, 2022

aljoscha commented Feb 9, 2022

philip-stoev left a comment

ruchirK commented Feb 9, 2022

ruchirK commented Feb 9, 2022

ruchirK commented Feb 9, 2022

test/persist: add test that compaction is happening as expected #10541

test/persist: add test that compaction is happening as expected #10541

Conversation

ruchirK commented Feb 8, 2022

Motivation

Tips for reviewer

Checklist

ruchirK commented Feb 9, 2022

aljoscha commented Feb 9, 2022

philip-stoev left a comment

Choose a reason for hiding this comment

ruchirK commented Feb 9, 2022

ruchirK commented Feb 9, 2022

ruchirK commented Feb 9, 2022